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INTRODUCTION 

Lung  cancer  continues  to  be  the  leading  cause  of  cancer-related  death  in  both  men  and  women 
in  the  United  States  \  The  majority  of  lung  cancers  are  non-small  cell  lung  cancers  (NSCLCs) 
that  include  squamous  cell  carcinomas  (SCCs)  and  adenocarcinomas  2  Lung  cancer  mortality 
is  high  in  part  because  most  cancers  are  diagnosed  after  regional  or  distant  spread  of  the 
disease  had  already  occurred  and  due  to  the  lack  of  reliable  biomarkers  for  early  detection  and 
risk  assessment  2.  T  he  identification  of  new  effective  early  biomarkers  will  improve  clinical 
management  of  lung  cancer  and  is  linked  to  better  understanding  of  the  molecular  events 
associated  with  the  development  and  progression  of  the  disease. 

It  has  been  suggested  that  histologically  normal-appearing  tissue  adjacent  to  neoplastic  lesions 
display  molecular  abnormalities  some  of  which  are  in  common  with  those  in  the  tumors  3.  This 
phenomenon,  termed  field  of  cancerization,  was  later  shown  to  be  evident  in  various  epithelial 
cell  malignancies,  including  lung  cancer 4,s.  Loss  of  heterozygosity  (LOH)  events  are  frequent  in 
cells  obtained  from  bronchial  brushings  of  normal  and  abnormal  lungs  from  patients  undergoing 
diagnostic  bronchoscopy  and  were  detected  in  cells  from  the  ipsilateral  and  contralateral  lungs 
6.  M  ore  recently,  global  mRNA  expression  profiles  have  been  des  cribed  in  the  normal¬ 
appearing  bronchial  epithelium  of  healthy  smokers  7 .  In  addition,  modulation  of  global  gene 
expression  in  the  normal  epithelium  in  health  smokers  is  similar  in  the  large  and  small  airways 
and  the  smoking-induced  alterations  are  mirrored  in  the  epithelia  of  the  mainstem  bronchus, 
buccal  and  nasal  cavities  8. 

In  this  program,  in  Specific  Aim  1,  high-throughput  microarray  mRNA  expression  analyses  are 
being  performed  on  cytological  specimens  (brushings)  obtained  at  intraoperative  bronchoscopy 
from  the  main  carina  and  main  ipsilateral  bronchus,  as  well  as  on  s  pecimens  obtained  at 
lobectomy  procedures  from  the  main  lobe  bronchus  (adjacent  to  SCCs),  sub-segmental 
bronchus  (adjacent  to  adenocarcinomas)  and  from  the  resected  NSCLC  tumors.  T owards  this 
aim,  we  are  comparing  and  c  ontrasting  global  gene  expression  patterns  across  all  the 
specimens  from  the  entire  field  and  corresponding  NSCLC  tumors.  We  are  deriving  lung 
adenocarcinoma  and  S  CC  field  cancerization  signatures  signifying  the  differential  mRNA 
expression  patterns  between  the  carina  and  the  subsegmental  bronchus  and  main  lobe 
bronchus,  respectively.  In  addition,  similar  expression  profiles  between  the  carina  and  resected 
NSCLC  tumors  will  be  i  ntegrated  with  available  gene  expression  data  of  bronchial  brushings 
from  the  main  carina  isolated  at  various  time  points  post-surgery  from  40  N  SCLC  patients; 
Department  of  Defense  (DoD)  VITAL  patients.  Promising  markers  derived  from  this  study  are 
being  validated  at  the  mRNA  and  protein  level  in  histological  tissue  specimens.  Moreover,  we 
are  currently  performing  RNA-sequencing  and  microarray  profiling  of  nasal  epithelia,  airway 
epithelial  cells  collected  from  both  bronchoscopy  and  I  obectomy  specimens  as  well  as  of 
corresponding  tumors  (NSCLC  patients)  or  benign  lesions  (cancer-free  individuals). 

In  Specific  Aim  2,  we  are  using  laser  capture  microdissection  to  obtain  specific  cell  populations 
(basal  cells  or  type  II  alveolar  cells,  depending  on  the  NSCLC  histology/location)  as  well  as 
premalignant  lesions  and  epithelial  components  of  the  tumors.  These  cell  populations  are  being 
profiled  with  RNA-seq  to  determine  their  gene  expression  signatures  to  increase  our 
understanding  of  premalignancy.  We  are  analyzing  the  gene  expression  profiles  that  are 
associated  with  progression  from  a  beni  gn  cell  population  to  premalignancy  and  w  ith 
progression  from  a  benign  cell  population  to  true  malignancy. 

In  future  studies,  in  Specific  Aim  3,  we  will  use  expression  signatures  and  biomarkers  derived 
from  the  results  of  aims  1  and  2  to  develop  and  test  airway-based  biomarkers  capable  of 
diagnosing  lung  cancer  in  current  or  former  smokers  using  minimally  invasive  sites. 
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This  report  details  the  progress  made  during  the  second  year  of  research. 


COMPREHENSIVE  ANNUAL  PROGRESS  REPORT  FROM  LEAD  PI  (DR.  STEVEN 

DUBINETT) 

Molecular  Profiles  for  Lung  Cancer  Pathogenesis  and  Detection  in  U.S.  Veterans 

Specific  Aim  1:  To  increase  our  understanding  of  the  molecular  basis  of  the 

pathogenesis  of  lung  cancer  in  the  “field  cancerization”  that 
develops  in  current  and  former  smokers. 

Summary  of  Research  Findings 

A.  Gene  and  protein  expression  analysis  of  bronchial  epithelial  samples  obtained  from 
bronchoscopy  from  NSCLC  patients  (Sub-specific  Aim  1A,  1C  and  ID):  This  analysis  was 
performed  on  samples  obtained  from  patients  enrolled  to  the  Vanguard  clinical  trial  supported 
by  the  DoD  VITAL  grant  (W81XWH-04-1-0412,  PI  Dr.  W.K.  Hong),  and  the  gene  profiling  work 
and  analysis,  and  t  he  protein  validation  study,  was  partially  supported  by  the  Lung  Cancer 
Consortium  grant  reported  here  (W81XWH-1 0-1 -1007). 

Gene  Expression  Analysis:  Patients  on  t  he  prospective  Vanguard  study  had  def  initively 
treated  ES  (I/ll)  NSCLC  and  were  current  or  former  smokers.  Patients  had  bronchoscopies  with 

brushings  obtained  from  the  main  carina  (MC), 
airways  adjacent  (ADJ)  to  the  previously 
resected  tumor  and  from  airways  distant  from 
the  tumor  in  the  ipsilateral  (NON-ADJ)  and 
contralateral  (CONTRA)  lung  at  baseline,  12, 
24  and  36  months  following  resective  surgery 
(Figure  1).  Nineteen  patients  were  selected  for 
the  study  based  on  airway  sampling  of  at  least 
five  different  regions  per  time  point  and 
continuously  up  to  24  or  36  months  (391  airway 
samples  from  nineteen  patients).  Data  from  the 
gene  expression  analysis  have  been  described 
and  detailed  in  the  previous  annual  report 
(Year  1).  Microarray  analysis  identified  gene  features  (n=1165)  that  were  non-uniform  by  site 
and  differentially  expressed  between  airways  adjacent  to  tumors  relative  to  more  distant 
samples  as  well  as  those  (n=1395)  that  were  significantly  altered  with  time  up  to  three  years.  In 
addition,  gene-interaction  networks  mediated  by  PI3K  and  ERK1/2  were  modulated  in  adjacent 
compared  to  contralateral  airways  and  the  latter  network  with  time.  Key  markers  of  the  identified 
gene-interaction  networks  were  validated  at  the  protein  level  by  immunohistochemical  analysis. 
The  study  highlighted  spatial  and  temporal  cancer-associated  expression  alterations  in  the 
molecular  field  of  injury  of  early  stage  NSCLC  patients  after  definitive  surgery  that  warrant 
further  validation  in  independent  studies.  These  findings  were  submitted  and  have  been 
accepted  for  publication  and  are  currently  in  press  (see  Reportable  Outcomes  and  Appendix). 

Protein  Expression  Analysis:  Our  findings  on  the  modulation  of  PI3K-  and  ERK1/2-mediated 
networks  by  site  and  time  after  surgery  prompted  us  to  examine  the  immunohistochemical  (IHC) 
expression  of  surrogate  markers  of  both  signaling  cascades  in  corresponding  formalin-fixed 
paraffin  embedded  (FFPE)  airway  biopsies.  We  sought  to  examine  expression  of  AKT 
phosphorylated  at  Threonine308  since  phosphorylation  of  this  amino  acid  is  well  known  to  occur 


Analysis  of  SITE-dependent 
gene  expression  patterns  (ADJ, 
NON-ADJ,  MC,  CONTRA) 

^  Global  gene  expression 
™  analysis 

Analysis  of  TIM  E-dependent 
gene  expression  patterns 
(baseline,  12h  24  and  36 
months. 

Figure  1.  Schematic  illustrating  sampling  and  collection  of 
bronchial  brushes  from  different  airway  sites  at  different  time 
points  following  resective  surgery  from  early  stage  smoker 
NSCLC  patients  enrolled  in  the  phase  II  cancer  surveillance 
Vanguard  study. 
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through  phosphoinositide-dependent  kinase  1  (PDPK1 )  following  PI3K  activation.  We  assessed 
the  immunohistochemical  expression  of  phospho-AKT(Thr308)  and  phos  pho- 
ERK1/2(Thr202/Tyr204)  in  available  and  ev  aluable  histologically  normal  bronchial  epithelia 
biopsies  (n=324)  corresponding  to  the  brushings  analyzed  by  expression  profiling.  The  intensity 
and  extent  of  cytoplasmic  and  nuclear  pAKT-Thr308  immunostaining  was  evaluated  using  a 
light  microscope  (magnification,  x20).  Cytoplasmic  expression  was  quantified  using  a  four-value 
intensity  score  (0,  none;  1,  weak;  2,  moderate  and  3,  strong)  and  the  percentage  (0-100%)  of 
the  extent  of  reactivity).  A  final  cytoplasmic  expression  score  for  pAKT-Thr308  was  obtained  by 
multiplying  the  intensity  and  reactivity  extension  values  (range,  0-300).  Nuclear  expression  was 
quantified  using  the  percentage  of  extent  of  reactivity  which  gave  rise  to  a  nuclear  expression 
score  for  pAKT-Thr308  (range,  0-100).  Immunoreactivity  of  phospho-AKT  (min  0,  max  300)  was 
variable  as  depicted  in  the  representative  photomicrographs  in  Figure  2A  and  was  detected  in 
both  the  cytoplasm  and  nucleus  of  normal  bronchial  epithelia  (NBE)  (Figure  2A).  IHC  analysis 
demonstrated  that  cytoplasmic  (p<0.0001)  and  nuclear  (p=0.01)  phospho-AKT  statistically 
significantly  increased  with  time  up  to  three  years  in  all  NBE  (Figure  2B)  with  highest 
expression  at  the  36  month  time  point.  Nuclear  phosphorylated  AKT  was  also  statistically 
significantly  increased  in  adjacent  NBE  compared  to  airways  relatively  more  distant  from  tumors 
in  the  mixed-effects  model  (p=0.04)  (Figure  2B).  Immunoreactivity  of  phospho-ERK1/2  was  also 
variable  (min  0,  max  209)  and  mainly  cytoplasmic  (Figure  2C).  IHC  analysis  demonstrated  that 
phosphorylated  ERK1/2  was  statistically  significantly  elevated  in  adjacent  NBE  (p=0.03)  and 
significantly  increased  with  time  up  to  three  years  in  all  airways  when  averaged  together 
(p=0.02)  (Figure  2D)  in  the  model. 

Notably,  there  was  a  significant 
interaction  (p=0.019)  between  site 
of  NBE  and  t  ime  of  sampling  as 
phospho-ERK1/2  expression  was 
significantly  increased  in  adjacent 
NBE  but  not  in  contralateral 
airways  and  main  carinas  in  the 
model  (Figure  2D)  with  highest 
expression  in  adjacent  and  non  - 
adjacent  (ipsilateral,  green  plot) 
airways.  These  data  demonstrated 
that  like  differential  gene 
expression  profiles  within  the  lung 
airway  field  of  injury,  canonical 
activated  oncogenes  are 
modulated  by  site  from  the 
resected  tumor  and  time  following 
definitive  surgery  in  early  stage 
NSCLC  patients  and  have  been 
also  accepted  for  publication  along 
with  the  gene  expression  data 
above  (Reportable  Outcomes  and 
Appendix). 
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Figure  2.  A.  Representative  photomicrographs  (20x  magnification)  depicting 
strong  (top)  and  weak  (bottom)  phospho-AKT(Thr308)  immunostaining.  B. 
Immunohistochemical  scores  of  cytoplasmic  (left)  and  nuclear  phospho-AKT 
(middle)  were  assessed  for  statistically  significant  differences  by  site  and  time 
in  a  m  ixed-effect  model  and  pi  otted  in  main  carinas  (MC)  and  i  n  adjacent 
(ADJ),  non-adjacent  (ipsilateral,  NON-ADJ)  and  contralateral  (CONTRA) 
airways  with  time.  C.  Representative  photomicrographs  (20x  magnification) 
depicting  strong  (top)  and  weak  (bottom)  phospho-ERK1/2(Thr202/Tyr204) 
immunostaining.  D.  Immunohistochemical  scores  of  phosphorylated  ERK1/2 
levels  were  assessed  for  statistically  significant  differences  by  site  and  time  in 
a  mixed-effect  model  and  repeated  measures  analysis,  site*time,  term  for 
interaction  between  site  and  time  factors.  Error  bars  represent  S.E.  of  the 
mean. 
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B.  Gene  expression  analysis  of  bronchial  epithelial  samples  obtained  from  lobectomy 
specimens  from  NSCLC  patients  ( Field  Cancerization  Study)  (Sub-specific  Aim  1A  and 
1C): 

Gene  Expression  Analysis:  To  increase  our 
understanding  of  the  molecular  basis  of  lung  cancer 
pathogenesis,  we  analyzed  the  transcriptome  profiles 
of  cytological  specimens  (brushings)  obtained  at 
lobectomy  or  pneumonectomy  procedures  from  2-5 
bronchioles  with  differential  proximity  from  resected 
tumors  and  f  rom  resected  lung  tumors  and  nor  mal 
parenchyma  (Figure  3).  Samples  were  obtained  from 
patients  who  did  not  receive  neoadjuvant  therapy 
undergoing  lobectomy  or  pneumonectomy  procedures 
(n=23)  under  an  M  D  Anderson  institutional  review 
board  (IRB)-approved  protocol.  A  summary  of  the 
clinicopathological  characteristics  of  the  studied  patient 
cases  has  been  des  cribed  in  the  previous  annual 
report  (Year  1).  194  samples  were  obtained  from 

surgically  resected  specimens  of  early-stage  (l-IIIA) 

NSCLC  patients  (n=20  cases).  NSCLCs  and  paired 
normal  lung  were  obtained  frozen,  preserved  in 
RNA later  or  by  surface-brushing.  Normal  bronchial 
epithelia  (NBE)  were  collected  by  brushing  bronchial 
structures  using  Cytosoft  cytology  brushes  (Cardinal 
Health).  Histological  assessment  of  NBE  by 
hematoxylin  &  eosin  (H&E)  and  pan  -cytokeratin  IHC 
analysis  confirmed  the  lack  of  preneoplastic  and 
neoplastic  cells  and  revealed  greater  than  90% 
epithelial  content,  respectively  (Figure  4).  Total  RNA  of 
all  samples  was  isolated  using  the  miRNeasy  Mini  kit 
following  homogenization  of  tissues  and  brushing 
collections.  RNA  quality  was  assessed  by  the  28S/18S 
ribosomal  RNA  ratio  and  by  RNA  integrity  number 
(RIN)  using  the  Agilent  Bioanalyzer.  Processed  RNA 
samples  were  hybridized  to  Affymetrix  GeneChip® 

Human  Gene  1.0  ST  arrays  and  were  then  scanned 
using  the  GeneChip®  Scanner  3000  f  rom  Affymetrix 
(Santa  Clara,  CA)  to  yield  raw  image  files  that  were 
subsequently  converted  to  probe  set  data.  Raw  data 
were  normalized  and  processed  using  the  robust  multi¬ 
chip  algorithm  (RMA)  algorithm.  We  then  applied  linear  mixed-effects  models  and  ANOVA- 
Tukey  tests  to  identify  differentially  expressed  genes  among  groups  based  on  feature-by- 
featured  t-test  p-values  and  false  discovery  rates  (FDRs). 


Figure  3.  Schematic  illustration  of  the  lung 
airway  for  molecular  mapping  of  field 
cancerization  by  high-throughput  profiling. 

Tumor  (T)  and  nor  mal  parenchyma  (NP) 
samples  were  obtained  by  brushing, 
preservation  in  RNA  later  and  snap-freezing  in 
liquid  nitrogen.  Airway  samples  (B1  to  B5)  were 
obtained  by  brushing. 


NSCLC  Normal  lung  Airway 


B  Pan-cytokeratin  Negative  control 


Figure  4.  Representative  H&E  analysis  (A)  of  an 
NSCLC  (left),  paired  normal  lung  tissue  (middle) 
and  airway  (right).  B.  Representative  cytokeratin 
staining  of  an  airway/bronchial  structure  showing 
positive  immunoreactivity. 
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We  then  sought  to  derive  gene  expression  signatures 
signifying  genes  significantly  and  c  oncurrently 
differentially  expressed  between  both  tumors  and 
airways  compared  to  normal  lung  tissue  (field 
cancerization  signatures).  We  first  analyzed  tumors, 
airways  and  normal  lung  pertaining  to  NSCLC  cases 
(n=20)  excluding  the  three  carcinoid  cases.  Following 
linear-mixed  effects  model  coupled  with  ANOVA-Tukey 
analysis,  we  identified  457  and  1204  g  enes  to  be  up- 
regulated  and  down-regulated,  respectively,  between 
both  NSCLCs  and  ai  rways  compared  to  normal  lung 
tissue  based  on  a  s  tatistical  threshold  of  a  p  - 
value<0.01  as  well  as  a  1.5  fold-change  in  expression. 
Hierarchical  clustering  analysis  demonstrated  that 
these  genes  were  able  to  separate  NSCLC  tumor 
samples  and  airway  samples  from  normal  lung  tissues, 
the  latter  being  segregated  in  a  s  eparate  cluster 
(Figure  5).  We  then  derived  such  field  cancerization 
signatures  when  analyzing  squamous  cell  carcinoma 
and  lung  adenocarcinoma  cases  (the  two  major 
subtypes  of  NSCLC)  separately.  Similar  analysis  was 
performed  using  the  same  selection  criteria 
mentioned  above.  We  derived  field  cancerization 
signatures  denoting  genes  significantly  and 
concordantly  differentially  expressed  between  both 
lung  tumors  and  ai  rways  compared  to  paired 
uninvolved  normal  lung  tissue  for  both  lung  SCC 
(n=5)  and  adenocarcinoma  (n=14)  cases  that  were 
comprised  of  1,803  and  1,735  genes,  respectively 
(Figures  6A  and  6B).  We  then  sought  to  functionally 
analyze  the  molecular  field  cancerization  in  NSCLC. 
We  performed  pathways  and  gene-interaction 
network  analysis  of  the  1,661  differentially  expressed 


Figure  5.  Field  cancerization  expression 
profiles  in  early-stage  NSCLC.  Hierarchical 
clustering  of  genes  significantly  differentially 
expressed  between  both  tumors  and  ai  rways 
compared  to  normal  lung  in  NSCLC  (n=1,661). 
Columns  represent  samples  (n=194  samples  from 
20  cases)  and  rows  represent  gene  features  (red, 
up-regulated;  green,  down-regulated). 


Normal  lung 

SCC  cases  ®  Airway  ADC  cases 


Figure  6.  Field  cancerization  expression  profiles  in 
early-stage  lung  SCCs  and  adenocarcinomas. 

Hierarchical  clustering  of  genes  significantly  differentially 
expressed  between  both  tumors  and  airways  compared  to 
normal  lung  in  SCC  (A,  n=1,803  genes)  and 

adenocarcinoma  (ADC)  (B,  n=1,735  genes)  cases.  Columns 
represent  samples  and  rows  represent  gene  features  (red, 
up-regulated;  green,  down -regulated). 


genes  described  in 
Figure  5  using  Ingenuity 
Pathways  Analysis 
(Redwood,  CA).  The  top 
functional  pathway  and 
gene-interaction  network 
are  represented  in 
Figure  7A  and  Figure 
7B,  respectively.  Certain 
molecules  within  the  top 
functional  pathway 
(MET,  TGFBR2)  and 
gene-interaction  network 
(i GPRC5A ,  VIPR1)  are 
highlighted  in  bold  as 
they  have  been  s  elected 
for  further  confirmation 


Figure  7.  Functional  pathways  analysis  of  NSCLC  molecular  field  cancerization. 

Functional  pathway  and  gene-interaction  network  analysis  of  the  NSCLC  field  cancerization 
signature  derived  in  Figure  5.  The  top  functional  pathway  and  gene-interaction  network  are 
presented  in  A  and  B,  respectively.  Molecules  selected  for  further  expression  validation  are 
in  bold.  Red  =  upregulated  genes,  Green  =  downregulated  genes. 
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and  validation  of  expression  in  subsequent  studies  (see  below).  It  is  noteworthy  that  functional 
analysis  revealed  that  the  identified  molecular  field  cancerization  in  NSCLC  was  comprised  of 
typical  cancer-associated  pathways  and  net  works  such  as  embryonic  stem  cell,  eicosanoid, 
retinoic  acid  receptor  (RAR),  NANOG  stem  cell,  NF-kB  and  PTEN  signaling  pathways  and  gene 
networks.  These  findings  highlight  expression  patterns  and  pa  thways  that  are  typically 
deregulated  in  overt  tumors  but  also  prevalent  in  histologically  normal  airway  epithelia,  and 
therefore,  may  play  important  roles  in  the  pathogenesis  of  lung  adenocarcinomas  and  SCCs. 

The  above  findings  highlight  expression  patterns  and  pa  thways  in  the  molecular  field 
cancerization  of  NSCLC  that  may  offer  both  detection  markers  as  well  as  therapeutic  targets  to 
guide  personalized  chemoprevention  in  early  stage  NSCLC  patients.  To  explore  this  concept, 
we  assessed  the  relevance  of  the  localized  field  cancerization  expression  profiles  identified  to 
those  of  large  airways  in  lung  cancer  patients  and  high-risk  smokers.  We,  therefore,  analyzed 
the  field  cancerization  profiles  from  Spira  et  al 9  comprised  of  129  proximal  airway  samples.  The 
gene  expression  raw  data  of  airway  epithelium  of  smokers  with  suspect  lung  cancer  were  re¬ 
normalized  to  produce  gene-level  expression  values  using  RMA  10  in  the  affy  package  11  of  the 
Bioconductor  software  suite  (version  2.4.1) 12  and  Entrez  gene-specific  probe  set  mapping  from 
BrainArray  (version  11.0.1) 13.  A  linear  model  was  then  used  to  adjust  the  expression  estimates 
for  the  mean  z-score  quality  metric  previously  described  in  the  original  report.  We  then 
performed  Gene  Set  Enrichment  Analysis  (GSEA)  14  and  a  list  of  all  Entrez  gene  identifiers 
within  the  field  cancerization  signature  was  ranked  according  to  the  Student  t  statistic  computed 
between  proximal  airways  from  cancer  and  non-cancer  patients  from  the  original  report  by  Spira 
et  al.  This  list  was  then  used  to  perform  a  pre-ranked  GSEA  analysis  to  identify  which  of  the 
field  cancerization  gene  sets  were  significantly  enriched  among  genes  that  were  up-  or  down- 
regulated  in  the  normal  bronchial  epithelium  of  smokers  with  suspect  lung  cancer.  A  leading 
edge  gene  set  was  then  derived  that  was  comprised  of  299  genes  that  were  concordantly 
modulated  between  NSCLCs  and  ai  rways  compared  to  normal  lung  in  the  localized  field 
cancerization  as  well  as  between  proximal  large  airways  of  cancer  patients  compared  to  non¬ 
cancer  patients  (Figure  8). 

Side  by  side  hierarchical 
cluster  analysis  using  the 
leading  edge  genes  of  the 
localized  field  cancerization 
samples  (left  panel)  and  of  the 
large  airways  of  cancer  and 
non-cancer  patients  (right 
panel)  revealed  concordance 
in  differential  expression 
among  the  sub  groups  (Figure 
8).  These  data  suggest  that 
expression  profiles  from  the 
localized  field  cancerization  of 
resected  NSCLC  specimens 
may,  in  part,  be  relevant  to 
differential  field  of  injury 
molecular  profiles  between 
cancer  and  non-cancer 
patients  and  may  comprise 
potential  targets  that  can  be 
used  for  both  detection  and 
chemoprevention  strategies. 
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Figure  8.  Analysis  of  the  localized  field  cancerization  in  uninvolved  mainstem 
bronchi  of  patients  with  suspect  lung  cancer.  GSEA  analysis  was  performed  to 
identify  genes  (n=299)  differentially  expressed  between  tumors  and  airways 
compared  to  normal  lung  in  the  localized  field  cancerization  (FC)  of  NSCLCs  that  are 
also  significantly  and  c  oncordantly  differentially  expressed  between  mainstem 
bronchi  of  cancer  and  non -cancer  patients.  Hierarchical  cluster  analysis  using  the 
299  leading  edge  genes  was  performed  side  by  side  in  the  localized  FC  (left  panel) 
and  mainstem  bronchi  of  cancer  (green)  and  non-cancer  (grey)  patients  (right  panel). 
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We  then  sought  to  further  study 
the  transcriptomic  architecture  of 
the  localized  field  cancerization  to 
determine  whether  the  molecular 
field  cancerization  exhibits  site  or 
distance  from  tumor-dependent 
expression  patterns.  Ordinal 
logistic  analysis  of  airway  samples 
alone  identified  genes  significantly 
progressively  and  di  fferentially 
expressed  by  distance  from  the 
tumors  (n=422)  (Figure  9A). 

Functional  pathways  analysis  of 
the  genes,  using  Ingenuity 
Pathways  Analysis  (Redwood,  CA) 
demonstrated  that  canonical  cancer-associated  pathways,  such  as  eukaryotic  initiation  factor, 
p70S6K  kinase,  polo-like  kinase  (PLK),  gluconeogenesis  and  mammalian  target  of  rapamycin 
(mTOR)  signaling,  were  significantly  and  progressively  modulated  in  the  airway  epithelia  by 
distance  from  the  tumor  (increased  with  shorter  distance  from  tumors)  (Figure  9B).  We  then 
derived  a  bioinformatic  index  score,  signifying  the  extent  of  the  site-dependent  modulation  of  the 
genes  in  the  airway  cancerization,  by  computing  the  difference  of  expression  values  of  the  two 
clusters  exhibited  in  the  heatmap  in  Figure  9A  across  samples.  The  score/index  quantitatively 
assessed  the  site-dependent  effect  and  was  increased  with  shorter  distance  of  the  airways  from 
the  tumors  (Figure  10A).  Notably,  the  extent  of  the  site-dependent  molecular  field  cancerization 
effect  was  more  pronounced  in  lung  SCC  cases  and  i  ncreased  at  a  much  larger  rate  with 
shorter  distance  from  the  tumors  in  this  NSCLC  subtype  compared  to  adenocarcinomas 
(Figures  10B  and  IOC)  which  may  be  related  in  part  to  the  proximal  location  of  the  sampled 
airways  to  SCCs  relative  to  adenocarcinomas.  However,  it  is  noteworthy  that  we  did  not  find 
significant  differences  in  site-dependent  expression  of  members  of  the  surfactant  family  such  as 
SFTPB  and  SFTPC. 


Since  we  performed  the  regression  analysis  on 
airways  alone,  we  then  deemed  it  worthy  to 
determine  whether  the  identified  significant  site- 
dependent  gene  expression  modulation  in  the 
airways  follow  the  same  pattern  between 
matched  tumors  and  u  ninvolved  normal  lung 
tissue.  Importantly,  the  site-dependent  genes 
signified  by  the  cancerization  index  score  were 
collectively  significantly  and  concordantly 
modulated  between  tumors  and  pai  red 
uninvolved  normal  lung  tissues  (Figure  10D).  It 
is  important  to  note  that,  although  the  site- 
dependent  molecular  field  cancerization 
expression  profiles  were  concordantly 
modulated  between  corresponding  NSCLCs 
and  matched  normal  lung  tissue,  they  did  not 
significantly  correlate  with  expression  profiles  of 
the  proximal  large  airways  (mainstem  bronchi) 
of  patients  with  suspect  lung  cancer.  GSEA 
analysis  of  the  aforementioned  422  genes 


2  3  4  5 

Airways 

Figure  10.  Site-dependent 
cancerization  is  more  pronounced  in  lung  SCCs  relative 
to  adenocarcinomas  and  informs  of  corresponding 
tumor  profiles.  Boxplots  depicting  site-dependent  molecular 
airway  field  cancerization  modulation  in  early-stage  NSCLCs 
(A),  adenocarcinomas  (B),  lung  SCCs  (C)  and  bet  ween 
paired  NSCLCs  and  normal  lung  tissue  (D).  Airway  distance 
from  tumor  is  numerically  indicated  with  a  range  of  1 
(closest)  to  5  (farthest). 
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Figure  9.  Site-dependent  modulation  of  the  molecular  field  cancerization 
in  early-stage  NSCLC.  A.  Ordinal  logistic  regression  analysis  of  airways 
identifying  422  genes  modulated  in  expression  by  distance  from  the  tumor.  B. 
Functional  pathways  analysis  of  the  genes  using  Ingenuity  Pathways  Analysis 
software. 
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(clustered  in  Figure  9)  did  not  reveal  significant  enrichment  of  this  expression  signature  in  the 
published  dataset  by  Spira  et  al  9.  These  findings  suggest  that  unlike  global  localized  field 
cancerization  signatures  we  described  above,  site  (from  tumor)-dependent  expression 
differences  in  the  localized  molecular  field  cancerization  may  not  be  relevant  to  the  extended 
airway  field  of  injury.  Nonetheless,  it  is  plausible  to  assume  that  the  site-dependent  localized 
molecular  field  cancerization  profiles  comprise  expression  profiles  with  typical  cancer- 
associated  signaling  pathways  and  inform  of  tumor  expression  patterns  and,  thus,  of  potential 
early  mechanisms  of  lung  cancer  pathogenesis  commencing  in  normal  airway  epithelia. 


Expression  validation  studies:  We  then  sought  to  validate,  in  part,  expression  of  field 
cancerization  markers.  We  selected  field  cancerization  markers  that  displayed  site-dependent  or 
-independent  differential  expression  following  microarray  profiling  analysis  of  the  localized  field 
cancerization  based  on  both  statistical  criteria  and  functional  pathways  analysis.  Out  of  the 
NSCLC  cases  studies, 

18  comprised  samples 
with  enough  RNA  left 
after  array  profiling  for 
performing  quantitative 
real-time  polymerase 
chain  reaction 

(QRTPCR)  analysis. 

Microarray  profiling 


identified 

associated 


lysosome 

protein 


transmembrane  4  beta 
(, LAPTM4B )  and 

neuropilin  (NRP)  and 
tolloid  (TLL)-like  2 
(NET02)  to  exhibit 
site-dependent 
differential  expression 
(increasing  in  airways 
closer  to 

corresponding  tumors) 
in  the  localized  field 
cancerization.  Both 
genes  were  not 
identified  in  the  global 
field  cancerization 
signatures  derived 
(Figures  5  and  6  ) 

when  all  samples  were 
analyzed  independent 
of  site.  Transforming 
growth  factor  receptor 
beta  II  (TGFBR2)  was 
found  to  be  decreased 
in  the  molecular  field 
cancerization  in  a  site- 
independent  manner 
and  vasoactive 
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VIPRi  expression  by  QRTPCR 
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Figure  11.  QRTPCR  analysis  of  airway  field  cancerization  markers.  Expression  of  LAPTM4B  (A), 
NET02  (B),  TGFBR2  (C)  and  VIPRI  (D)  was  analyzed  by  QRTPCR  in  18  out  of  20  cases  with  sufficient 
RNA  from  samples.  QRTPCR  analysis  was  performed  in  a  site-dependent  manner  (left  panels)  and  in  a 
site-independent  manner  comparing  tumors,  airways  and  normal  lung  tissues  (middle  panels).  Correlation 
between  expression  quantified  by  microarray  and  QRTPCR  analyses  was  statistically  assessed  by 
Spearman  rank  (right  panels). 
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intestinal  peptide  receptor  1 
(VIPR1)  exhibited  both  site- 
dependent  and  i  ndependent 
decreased  expression  in  the 
molecular  field  cancerization. 

We  performed  QRTPCR  analysis 
of  LAPTM4B,  NET02,  TGFBR2 
and  VIPR1  on  all  evaluable 
cases  (Figures  11  A-D). 

QRTPCR  analysis  confirmed  the 
microarray  findings  of  the 
differential  expression  pattern  of  all  four  genes  in  the  molecular  field  cancerization  (Figures 
11  A-D).  LAPTM4B  and  NET02  were  significantly  increased  in  airways  closer  to  tumors 
compared  to  more  distant  normal  epithelia  (p<0.05)  (Figures  11A-B,  left  panels).  Similar  to  the 
array  findings,  LAPTM4B  and  NET02,  although  increased  in  tumors  compared  to  normal  lung 
tissue,  exhibited  similar  levels  of  expression  among  airways,  when  averaged  all  together,  and 
normal  lung  (Figures  11A-B,  middle  panels).  Moreover  and  closely  similar  to  the  microarray 
data,  TGFBR2  and  VIPR1  mRNA  expression  were  decreased  in  tumors  and  airways  compared 
to  normal  lung  tissue  (Figures  11C-D,  middle  panels).  Whereas  TGFBR2  did  not  exhibit  site- 
dependent  differential  expression  patterns  (Figure  11C,  left  panel),  VIPR1  was  progressively 
decreased  in  airways  with  closer  distance  from  corresponding  tumors  (Figure  11D,  left  panel). 
Importantly,  we  found  significant  and  high  correlation  between  expression  of  the  four  genes 
analyzed  by  microarray  and  expression  quantified  by  QRTPCR  (all  p<0.0001)  (Figures  11  A-D, 
right  panels).  We  next  performed  western  bot  analysis  of  the  20  N  SCLC  field  cancerization 
case  tumors  and  corresponding  uninvolved  normal  lung  tissues  for  assessing  expression  of  the 
protein  products  coding  for  NET02  and  TGFBR2.  Western  blot  analysis  confirmed  the  tumor 
versus  normal  differential  expression  of  the  field  cancerization  markers  at  the  protein  level 
(Figure  12).  It  is  noteworthy  that  we  were  not  successful  to  assess  the  protein  expression  of 
LAPTM4B  as  the  limited  antibodies  available  for  this  recently  identified  gene  did  not  work  well  in 
our  hands.We  also  confirmed  expression  of  additional  field  cancerization  markers.  Microarray 
analysis  revealed  that  the  MET  oncogene  was  increased  in  the  molecular  field  cancerization, 
and  did  not  exhibit  site-dependent  differential  expression  features.  We  analyzed  MET  mRNA 
and  protein  expression  by 
QRTPCR  and 

immunohistochemical 
analysis,  respectively,  in  an 
independent  set  of  cases 
and  samples  we  had  not 
profiled  (n=6  cases). 

QRTPCR  validated  the 
microarray  findings  in  the 
independent  set  and 
demonstrated  that  MET 
mRNA  expression  was 
significantly  higher  in  both 
lung  tumors  and  airways 
compared  to  normal  lung 
tissue  ( p<0.0001 )  (Figures 
13A-B). 

Immunohistochemical 
analysis  confirmed  the  array 
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Figure  13.  Validation  of  increased  expression  of  the  MET  oncogene  in  the  field 
cancerization.  Quantitative  real-time  PCR  analysis  of  MET  mRNA  expression  in 
adenocarcinomas  and  matched  airways  and  normal  lung  (A  and  B).  C.  Representative 
MET  immunohistochemical  expression  in  a  case  comprised  of  a  tumor,  uninvolved 
normal  lung  and  five  bronchial  structures.  D.  Correlation  plot  of  MET  mRNA  and  protein 
expression  in  the  same  cases  by  Pearson  correlation. 
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Figure  12.  Western  blotting  analysis  of  field  cancerization  markers  in 
NSCLCs  and  corresponding  normal  lung  tissues.  NSCLC  and  m  atched 
normal  lung  tissues  (20  field  cancerization  case  pairs)  were  lysed  and 
homogenized  after  which  20  pg  of  the  total  protein  homogenates  were  subjected 
to  SDS-PAGE.  Membranes  were  incubated  with  primary  antibodies  raised 
against  NET02  and  TGFBR2.  Membranes  were  also  incubated  with  antibody 
against  proliferating  cell  nuclear  antigen  (PCNA)  to  confirm  tumor  and  normal 
samples  and  with  (3-actin  to  ensure  equal  loading. 
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and  QRTPCR  findings  at  the  protein  level  and  demonstrated  increased  total  MET 
immunohistochemical  expression  in  tumor  and  normal  bronchial  epithelial  histological  tissue 
sections  compared  to  those  of  normal  lung  tissue  (Figure  13C).  In  addition  and  importantly,  we 
found  statistically  significant  correlation  (p<0.001)  between  MET  mRNA  assessed  by  QRTPCR 
and  protein  expression  analyzed  by  IHC  in  the  same  samples  studied  (Figure  13D). 
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Our  microarray  analysis  also  pinpointed  to  a  decreased  expression  of  the  lung-specific  tumor 
suppressor  gene  G-protein  coupled  receptor  family  C,  group  5,  member  A  ( GPRC5A )  in  tumors 
and  airways  compared  to  uninvolved  normal  lung  tissue.  We  performed  QRTPCR  analysis  of 
GPRC5A  mRNA  expression  in  NSCLCs,  normal  lung  tissue  and  bronchial  epithelia  from  six 
independent  cases  we  had  not  profiled.  QRTPCR  analysis  confirmed  the  microarray  findings 
and  demonstrated  that  GPRC5A  expression  was  significantly  decreased  in  the  molecular 
localized  field  cancerization  (Figures  14A-B).  GPRC5A  was  identified  as  a  lung-specific  tumor 
suppressor  gene  evidenced  by  spontaneous  adenocarcinoma  formation  in  mice  with  knockout 
of  both  of  the  gene’s  alleles  15.  More  recently,  GPRC5A  was  shown  to  exert  its  tumor 
suppressive  function,  in  part,  by 
inhibition  of  nuclear  factor-kappa  B 
(NFkB)  and  downstream  inflammation 
16.  Therefore,  we  sought  to  analyze 
GPRC5A  expression  in  the  molecular 
field  cancerization  associated  with 
chronic  obstructive  pulmonary  disease 
(COPD),  a  risk  factor  for  lung  cancer 
that  is  typically  associated  with 
inflammation  17.  GPRC5A  expression 
was  assessed  in  histological  tissue 
specimens  of  NSCLC  and  normal 
bronchial  epithelia  including  those 
from  lung  cancer-free  COPD  and  from 
NSCLC  patients.  Representative 
photomicrographs  of  GPRC5A 
immunohistochemical  staining  in  the 
different  normal  bronchial  epithelia  is 
depicted  in  Figure  15A.  GPRC5A 
immunoreactivity  appeared  to  be 
highest  in  normal 
bronchial  epithelia 
from  lung  disease- 
free  never-  and  ever- 
smokers  and 

successively 
significantly 
decreased  in  NBE  of 
COPD  patients  who 
are  cancer-free  and 
lowest  in  NBE  of 
patients  with  both 
COPD  and 

adenocarcinoma  or 
SCC  (Figure  15B).  A 
general  linear  model 
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Figure  14.  QRTPCR  analysis  of  GPRC5A  expression  in  the  localized 
field  cancerization.  Total  RNA  isolated  from  tumors,  normal  lung  tissue 
and  normal  bronchial  epithelia  from  six  NSCLC  cases  (A)  was  analyzed 
by  QRTPCR  for  GPRC5A  expression.  B.  Box  plot  showing  average 
GPRC5a  expression  in  NSCLCs,  bronchial  epithelia  and  normal  lung 
tissue  statistically  analyzed  by  the  Kruskal-Wallis  test. 
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Figure  15.  GPRC5A  expression  in  COPD-associated  field  cancerization.  A.  Representative 
photomicrographs  of  GPRC5A  expression  in  NBE  from  disease-free  never-  and  ever-smokers 
(upper  left  and  right,  respectively)  and  in  NBE  from  cancer-free  COPD  patients  (bottom  left)  and 
NSCLC  patients  with  COPD  (bottom  right).  B.  Box-plots  depicting  quantification  of  GPRC5A 
immunohistochemical  expression  represented  in  (A).  Main  P-value  signifies  statistical 
significance  of  reduced  GPRC5A  expression  in  the  sequential  pathogenesis  of  COPD- 
associated  NSCLC  and  among  the  five  groups  by  the  general  linear  model  and  Kruskal-Wallis 
test. 
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demonstrated  a  significant  gradual  decrease  of  GPRC5A  expression  from  NBE  of  disease-free 
never-smokers  to  NBE  from  patients  with  COPD  and  adenocarcinoma  or  SCC  (p<0.0001) 
(Figure  15B).  The  mean  (177.31  ±  30.66)  and  median  (195;  min,  130;  max,  220)  of  GPRC5A 
expression  in  NBE  of  lung  disease-free  never-smokers  were  higher  than  that  of  smokers  (mean, 
155.28  ±  34.15,  median,  147.5;  min,  100;  max,  230)  although  the  difference  was  not  statistically 
significant  (Figure  15B).  Moreover,  GPRC5A  expression  was  significantly  higher  in  NBE  of  lung 
disease-free  smokers  relative  to  NBE  in  cancer-free  COPD  patients  (p=0.007),  which  in  turn 
was  significantly  higher  compared  to  the  expression  in  NBE  from  COPD  patients  with 
adenocarcinoma  (p<0.0001)  or  SCC  (p<0.0001)  (Figure  15B).  Furthermore,  although  GPRC5A 
expression  was  lower  in  NBE  of  COPD  patients  with  SCC  compared  to  those  with  COPD  and 
adenocarcinoma,  the  difference  was  not  statistically  significant  (Figure  15B).  These  findings 
suggest  that  reduced  expression  of  the  GPRC5A  tumor  suppressor  may  be  implicated  in  the 
pathogenesis  of  NSCLC  associated  with  inflammatory  COPD.  Our  findings  on  the  differential 
expression  of  GPRC5A  in  the  molecular  field  cancerization  including  that  associated  with  COPD 
have  been  ac  cepted  for  publication  and  ar  e  in  press  (see  Reportable  Outcomes  and 
Appendix). 

Functional  studies:  As  mentioned 
above,  we  confirmed  and  validated 
the  differential  expression  of  selected 
field  cancerization  markers  in 
matched  NSCLCs,  bronchial  epithelia 
and  uninvolved  normal  lung  tissue. 

Some  of  the  field  cancerization 
markers  exhibit  well-reported  roles  in 
lung  cancer  pathogenesis  (e.g.,  MET, 

GPRC5A)  whereas  other  validated 
molecules  are  either  recently  shown 
to  function  in  lung  carcinogenesis 
[TGFBR2  18'19]  or  have  not  been 
reported  to  exhibit  aberrant  function  in 
the  pathogenesis  of  lung 
malignancies  (e.g.,  LAPTM4B  and 
NET02).  We  were  prompted  to 
assess  the  expression  and  potential 
functional  relevance  of  the  latter 
group  of  under  studied  genes  in  an  in  vitro  model  of  lung  carcinogenesis  comprised  of  normal 
(NHBE),  immortalized  (BEAS-2B  and  1799)  and  transformed  (1198)  and  tumorigenic  (1170) 
bronchial  and  lung  epithelial  cells  19. 

QRTPCR  analysis  demonstrated  that,  in  accordance  with  our  findings  on  differential  expression 
patterns  in  the  molecular  field  cancerization,  LAPTM4B  and  NET02  were  both  up-regulated  in 
tumorigenic  lung  epithelial  cells  compared  to  their  normal  counterparts  whereas  TGFBR2  was 
down-regulated  in  the  tumorigenic  cells  (Figures  16A-C).  Moreover,  the  differential  expression 
of  NET02  and  TGFBR2  in  the  utilized  in  vitro  model  was  confirmed  at  the  protein  level  by 
western  blot  analysis  (Figure  16D).  These  findings  demonstrate  that  novel  field  cancerization 
markers  may  have  important  relevance  to  lung  cancer  pathogenesis. 


TGFBR2 


NHBE  BEAS-2B  1799  1198  1170 

Figure  16.  Differential  expression  of  selected  field  cancerization 
markers  in  an  in  vitro  model  of  lung  carcinogenesis.  QRTPCR 
analysis  of  LAPTM4B  (A),  NET02  (B)  and  TGFBR2  (C)  expression  in 
normal  (NHBE),  immortalized  (BEAS-2B  and  1799)  and  transformed  and 
tumorigenic  (1198  and  1170)  lung  epithelial  cells.  D.  Western  blot  analysis 
of  NET02  and  TGFBR2  protein  expression  in  the  in  vitro  model. 
Membranes  were  stained  with  antibody  against  actin  to  ensure  equal 
loading. 
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Microarray  analysis  of  the 
field  cancerization  in  early- 
stage  NSCLC  revealed  that 
LAPTM4B,  a  lysosome 
associate  transmembrane 
putative  oncogene  with  no 
known  role  in  lung 
carcinogenesis  20,  was  among 
the  top  5  site-dependent  field 
cancerization  markers. 

Moreover  and  in  our  hands,  LAPTM4B  was  significantly  elevated  in  NSCLC  and  immortalized 
bronchial  epithelial  cell  lines  compared  to  normal  cells.  We  then  explored  LAPTM4B  expression 
in  a  I  arge  available  array  dataset  of 
NSCLC  cell  lines  from  Dr.  John  D.  Minna 
(University  of  Texas  Southwestern,  Dallas, 

Texas).  Microarray  analysis  of  the  cell 
lines  demonstrated  that  LAPTM4B 
expression  was  significantly  higher  (left 
cluster)  in  a  fraction  of  NSCLC  cell  lines 
compared  to  bronchial  cells  (right 
dendogram  cluster)  (Figure  17).  These 
findings  utilizing  different  cell  culture 
systems  suggest  that  field  cancerization 
markers,  exemplified  by  LAPTM4B,  may 
be  modulated  in  expression  in  the 
pathogenesis  sequence  of  lung  cancer. 

We  then  attempted  to  preliminarily  assess  the  functional  relevance  of  the  field  cancerization 
marker  LAPTM4B  to  NSCLC  malignant  phenotype.  Microarray  analysis  of  the  large  array  of 
NSCLC  cell  lines  (Figure  17)  aided  us  to  select 
cell  lines  with  relatively  high  expression  of  the 
gene.  We  employed  SMARTpool  small  interfering 
RNA  (Dharmacon,  Inc.,  Lafayette,  CO)  that 
consist  of  pooled  siRNAs  targeting  different 
regions  of  the  LAPTM4B  transcript.  Transient 
transfection  of  Calu-6  NSCLC  cells  with 
L4P77W4B-targeting  siRNA  effectively  reduced 
LAPTM4B  expression  (Figure  18A)  and 
significantly  decreased  cell  growth  evidenced  by 
trypan  blue  exclusion  cell  count  (Figure  18B).  The 
effect  of  RNA  interference-mediated  knockdown  of 
LAPTM4B  expression  on  cell  growth  was 
confirmed  in  another  NSCLC  cell  line  (HI 650, 

Figures  19A,  C  and  E)  and  in  the  immortalized 
bronchial  epithelial  cell  line  BEAS-2B  (Figures  19 
B,  D  and  F)  further  demonstrating  that  LAPTM4B 
is  a  positive  mediator  of  cell  growth  in  NSCLC  cell 
lines.  We  sought  to  further  confirm  the  role  of 
LAPTM4B  in  lung  cancer  cell  growth  by 
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Figure  19.  Knockdown  of  LAPTM4B  reduces  cell  growth. 
QRTPCR  (A-B)  and  cell  growth  analysis  by  trypan  blue  exclusion 
(C-D)  of  NSCLC  and  immortalized  bronchial  epithelial  cells 
transfected  with  control  or  LAPTM4B  specific  siRNA.  Phase- 
contrast  micrographs  (x20)  of  control  and  LAPTM4B- siRNA 
transfected  HI 650  (E)  and  BEAS-2B  (F)  cells. 


A  B 


hours 


Figure  18.  Knockdown  of  LAPTM4B  reduces  lung  cancer  cell 
growth.  A.  QRTPCR  analysis  of  LAPTM4B  expression  in  Calu6 
cells  transfected  with  control  siRNA  or  LAPTM4B- specific  siRNA. 
B.  Trypan  blue  exclusion  count  of  cells  at  24,  48  and  72  h  following 
transfection  of  control  or  L4PT/W4B-targeting  siRNA. 


LAPTM4B 


■  Immortalized  HBECS 
C  NSCLC 


Figure  17.  LAPTM4B  expression  in  NSCLC  and  immortalized  bronchial 
epithelial  cell  lines.  Increased  expression  of  LAPTM4B  in  NSCLC  cells  compared 
to  HBECS  by  microarray  analysis. 
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generating  sub-lines  with  stable 
knockdown  of  the  gene  in  the  Calu- 
6  cell  line  with  high  basal  level  of 
LAPTM4B.  E  mpty  vectors  and 
vectors  containing  scrambled  small 
hairpin  RNA  (shRNA)  as  well  as 
shRNA  specific  to  LAPTM4B 
(Origene,  Rockville,  MD)  were 
used  for  generation  of  the  stable 
sub-lines.  Stable  selection  was 
performed  by  incubating 
transfected  cells  for  two  weeks  in 
cell  culture  medium  with  10  pg 
puromycin.  QRTPCR 

demonstrated  significantly  reduced 
LAPTM4B  expression  in  Calu-6 
cells  with  stable  knockdown  of 
LAPTM4B  compared  to  cells  stably 
transfected  with  scrambled  shRNA 
or  empty  vectors  (Figure  20A). 

Moreover,  Calu-6  cells  with  stable 
knockdown  of  LAPTM4B  displayed 
significantly  reduced  cell  growth 
and  anchorage-dependent  colony 
formation  (Figure  20B  and  Figures  20C-D,  respectively).  These  data  demonstrate  that 
LAPTM4B  field  cancerization  marker  is  a  positive  mediator  of  immortalized  and  malignant  lung 
epithelial  cell  growth.  Moreover,  these  findings  are  proof  of  principle  and  demonstrate  that 
functional  characterization  of  field  cancerization  markers  in  the  lung  airway  can  lead  to  the 
identification  of  otherwise  unknown  functional  targets  and  mediators  of  lung  cancer  molecular 
pathogenesis.  Our  findings  on  the  aforementioned  microarray  analysis  of  the  field  cancerization 
as  well  as  on  the  expression  validation  and  functional  studies  of  field  cancerization  markers 
have  been  pr esented  as  an  abstract  and  are  currently  being  prepared  as  a  manuscript  for 
publication  which  we  anticipate  submitting  for  peer-review  by  the  end  o  f  the  year  (see 
Reportable  Outcomes  and  Appendix). 

C.  Collection  of  epithelial  samples  from  both  bronchoscopy  and  lobectomy  specimens 
from  patients  with  lung  cancer  (Sub-specific  Aims  1A  and  1C): 

We  obtained  consent  from  35  study  participants  to  collect  tissue  samples.  From  these  subjects 
we  have  collected  sets  of  bronchoscopy  (nasal  and  bronchial  brushes)  and  lobectomy  (tumor, 
normal  parenchyma,  and  bronchial  brushes)  epithelial  samples  (Table  1). 


Table  1.  Specimen  types  collected 


Specimen  type 

Number  of 
specimens 

Blood 

26 

Bronchial  epithelial  brushings 

145 

Distal  airway  brushings 

3 

Nasal  epithelial  brushings 

33 

Tumor  tissue 

8 

Empty  vector 


Scrambled 

shRNA 


vBKjJi 


/  N 


Figure  20.  Stable  knockdown  of  LAPTM4B  reduces  lung  cancer  cell 
growth  and  colony  formation.  A.  QRTPCR  analysis  of  Calu-6  cells  stably 
transfected  with  empty  vectors  or  vectors  containing  scrambled  shRNA  or 
LAPTM4B-spec\f\c  shRNA.  B.  Trypan  blue  exclusion  method  count  of  the 
stably  transfected  cells.  C-D.  Anchorage-dependent  colony  formation  assay  of 
the  stably  transfected  Calu-6  cells  was  performed  by  seeding  cells  at  a 
seeding  density  of  500  c  ells  per  well  in  six-well  plates.  Representative 
triplicate  images  were  obtained  with  phase-light  microscope  (C)  and  quantified 
to  assess  number  of  colonies  (D). 
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The  samples  are  currently  being  analyzed  by  both  next  generation  RNA-sequencing  (RNA-Seq) 
using  the  lllumina  HiSeq  2000  platform  and  microarray  profiling  using  the  Human  Gene  2.0  ST 
platform  from  Affymetrix.  RNA-Seq  and  microarray  analysis  are  being  performed  at  BU  and  MD 
Anderson  Cancer  Center,  respectively.  The  study  was  uniquely  designed  by  collecting  tumor, 
normal  lung  tissue  and  airway  samples  from  cases  with  NSCLC  obtained  from  all  four 
participating  institutions  (Table  2).  Samples  from  cases  without  lung  cancer  were  collected  and 
processed  from  BU,  UCLA  and  Vanderbilt  (Table  3).  The  demographics  of  the  patients  are 
shown  in  Table  4. 

Total  RNA  from  all  samples  in  the  different  institutions  were  isolated  similarly  using  the 
miRNeasy  kit  from  Qiagen  according  to  the  manufacturer’s  instructions.  Hiseq  RNA  sequencing 
will  facilitate  the  discovery  of  novel  transcripts  in  the  molecular  field  of  injury  as  well  quantifying 
expression  of  those  that  could  not  be  characterized  by  microarray  technology.  In  addition  and  in 
contrast  to  the  Human  Gene  1 .0  ST  microarray  platform  used  for  the  field  cancerization  profiling 
described  above,  the  newer  2.0  ST  platform  to  be  utilized  will  enable  the  interrogation  of  the 
complete  set  of  known  functionally  important  long  intergenic  non-coding  RNAs  (lines)  in  the 
molecular  field  of  injury  21 .  To  date  this  is  the  first  study  that  is  1)  performing  next  generation 
sequencing  in  addition  to  microarray  profiling  analysis  of  the  molecular  filed  of  injury;  2)  applying 
multiple  high  throughput  technologies  on  field  of  injury  samples  obtained  from  four  different 
institutions  in  the  nation  and  3)  cross-comparing  the  complete  topological  map  of  the  extended 
and  localized  molecular  field  of  injury/cancerization  between  both  NSCLC  patients  and  cancer- 
free  individuals.  We  anticipate  that  RNA-Seq  and  microarray  profiling  will  be  completed  by  the 
end  of  the  year  with  subsequent  bioinformatic  and  functional  analysis  along  with  validation  of 
expression  studies  completed  by  Spring  2013.  Due  to  the  use  of  both  next  generation  RNA 
sequencing  and  comprehensive  microarray  profiling  and  due  to  this  ongoing  study’s  unique 
design  we  anticipate  that  expression  profiles  in  the  NSCLC  molecular  field  of  injury  will  harbor 
molecules,  both  novel  and  established,  that  may  exhibit  potential  for  use  as  airway  biomarkers 
that  can  be  developed  and  tested  for  lung  cancer  detection  using  minimally  invasive  sites  in 
Specific  Aim  3  of  this  award. 

Table  2.  Molecular  mapping  of  the  field  of  injury  in  NSCLC  and  cancer-free  patients 


RNA-Seq  (cases) 

Microarray  (cases) 

Institution 

ADC 

SCC 

No  Cancer 

ADC 

SCC 

No 

Cancer 

MD  Anderson 

4 

2 

0 

4 

3 

0 

BU 

2 

1 

3 

0 

2 

4 

UCLA 

2 

2 

1 

3 

2 

2 

Vanderbilt 

1 

1 

1 

4 

3 

1 

Number 

9 

6 

5 

11 

10 

7 

Total  Nb.  of  cases 
analyzed 

20 

28 

Total  Nb.  of  samples 
analyzed 

156 

183 

RNA-Seq,  RNA  sequencing;  ADC,  adenocarcinoma;  SCC,  squamous  cell  carcinoma;  BU, 
Boston  University;  UCLA,  University  of  California  Los  Angeles. 
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Table  3.  Diagnoses  of  patients  participating  in  our  study 


Number  of  Subjects 

Diagnosis 

14 

Lung  cancer 

5 

Benign 

1 

Other  cancer 

15 

Pending 

Table  4.  Demographics  of  study  participants 


Ethnicity 

Male 

Female 

White 

14 

5 

Black 

7 

3 

Hispanic 

1 

0 

Asian 

2 

1 

American  Indian 

0 

0 

Other 

1 

0 

Unknown 

0 

0 

Specific  Aim  2:  To  increase  our  understanding  of  the  role  of  tumor-initiating 

stem/progenitor  cells  in  the  pathogenesis  of  lung  cancer  in  the  “field  cancerization”  that 
develops  in  current  and  former  smokers. 

Summary  of  Research  Findings: 

A.  Feasibility  of  sequencing  small  amounts  of  RNA  from  laser  captured  samples  that 
reflect  different  pathologic  stages  of  lung  carcinogenesis  (Sub-specific  Aim  2B): 

Specific  regions  of  normal  basal  cells,  premalignant  metaplastic/dysplastic  cells,  and  squamous 
carcinoma  cells  were  successfully  selected 
by  laser  microdissection.  This  was 
described  and  detailed  in  the  previous 
annual  report  (Year  1).  Adequate  amounts 
of  RNA  were  isolated  from  these  cells  for 
library  preparation  and  high  throughput 
sequencing  (RNA-seq),  and  dec  ent 
quantities  of  libraries  with  appropriate  size 
ranges  were  generated.  This  was  described 
and  detailed  in  the  previous  annual  report 
(Year  1).  Samples  were  sequenced  on 
lllumina  Genome  Analyzer  llx  or  HiSeq 
2000  instruments,  producing  single-end 
reads  with  quality  control  Phred  scores 
above  30. 


n=2  patients 

n=2  patients 

RNA  sequencing  platform 

Genome  Analyzer  llx 
36-base,  single-end 

HiSeq  2000 
50-base,  single-end 

Bowtie 

¥ 

^  TopHat 

Alignment  method 

Bowtie 

TcpRat 

I 

Uniquely  aligned  reads/sample 

~I0-1S  million  reads 

~S0-GC  million  reads 

Y 

V 

coverageBed  (BEDTools  suite} 

Gene-level  kh  prats  Ion 

_ 1 _ 

_ I _ 

Expression  of  ~351,000 

Ensembl  Gene  (EN5G)  IDs 

Figure  21.  RNA  sequencing  workflow.  Single-end  reads 
were  aligned  to  the  human  genome  and  gene  ex  pression  was 
quantified  by  measuring  the  overlap  with  Ensembl  Gene  IDs. 
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Sequence  alignment  and  quantification  of  gene  expression 


The  reads  produced  by  sequencing  each  RNA  sample  were  aligned  to  the  human  genome 
(build  hg19).  Uniquely  aligned  reads  were  then  used  to  compute  gene  expression  estimates  by 
measuring  the  coverage  of  each  of  -35,000  Ensembl  Gene  loci  using  the  coverageBed  utility  in 
the  BEDTools  suite.  This  workflow  is  illustrated  in  Figure  21. 


Identification  of  genes  with  progression-associated  expression  patterns 


To  identify  genes  whose  expression 
was  associated  with  progression  from 
normal  to  intermediate  (metaplastic  or 
dysplastic)  to  tumor  cells,  a  t  hree-step 
procedure  was  used  as  outlined  in 
Figure  22.  Fi  rst,  genes  with  low 
expression  (median  expression  was 
below  100  reads),  which  are  more  likely 
to  be  false  positives,  were  removed 
from  analysis.  N  ext,  to  simplify  the 
analysis,  a  concordance  filter  was  then 
applied  in  order  to  consider  only  those 
genes  whose  expression  changed  in  the 
same  direction  in  both  intermediate  and 
tumor  cells  relative  to  normal  cells. 


35,413  ENSG  IDs 


Expression  filter 


Median  counts  >  100  across  all  samples 


4,776  ENSG  IDs 


Concordance  fitter 


direction  (intermediate  vs,  normal)  = 
direction  (tumor  vs.  normal) 


3,761  ENSG  IDs 
intermediate  vs.  normal 


Linear  mixed-effects  {(me) 

Treats  expression  as  continuous  variable; 
treats  patients  as  random  effect 


Negative  binomial  (edgeR) 

Treats  expression  as  discrete  variable; 
treats  patients  as  fixed  effect 


p<  0.Q5 


Figure  22.  Identifying  genes  associated  with  SCC  progression. 

Genes  were  filtered  by  median  expression  and  by  direction  of  change 
relative  to  normal  controls  before  assigning  significance  using  two 


Finally,  two  different  statistical  models  were 
used  to  identify  genes  whose  expression  was 
significantly  altered  between  intermediate  and 
normal  cells.  I  n  one  case,  a  I  inear  mixed- 
effects  approach  was  used,  which  assumes 
that  expression  is  a  continuous  measure,  and 
in  which  sample  group  and  pat  ient  were 
modeled  as  fixed  and  random  effects, 
respectively.  Separately,  a  negative  binomial 
model  was  used,  which  assumes  that 
expression  is  a  discrete  variable,  and  in  which 
both  patient  and  sample  group  were  modeled 
as  fixed  effects.  The  gene  lists  obtained  with  a 
nominal  p  value  cutoff  of  0.05  by  either 
approach  were  intersected  to  obtain  a  I  ist  of 
104  genes.  The  expression  of  these  genes  in 
all  samples  from  all  four  patients  is  illustrated 
in  Figure  23. 

B.  Data  Validation  with  Quantitative  Real- 
Time  PCR  and  I  mmunofluorescence 
Staining: 


Patient  1 


Patient  2 


Patient  3 


Patient  4 


Figure  23.  Expression  of  104  genes  associated  with 
SCC  progression.  The  raw  counts  have  been  scaled  to  a 
mean  of  zero  and  unit  variance  within  each  patient.  Blue 
and  red  indicate  expression  that  is  two  or  more  standard 
deviations  below  or  above  the  mean  expression  in  each  row, 
respectively. 


To  validate  the  data  from  gene  expression 
analysis  of  the  first  four  patients,  we  performed 
quantitative  real-time  PCR  (qPCR)  on  selected 
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candidate  genes.  From  the  RNA-seq  data,  we  calculated  the  coefficient  of  variance  of  the  reads 
per  kilobase  per  million  reads  (RPKM)  across  all  samples  for  23  g  enes  that  were  commonly 
used  as  endogenous  control  genes  in  TaqMan-based  qPCR  assays.  From  this  analysis,  we 
chose  Beta-2-microglobulin  (B2M)  as  the  endogenous  control,  as  it  has  the  lowest  coefficient  of 
variance  as  well  as  the  highest  median  read  counts.  We  used  two  of  the  sequenced  patients, 
Patients  3  and  4,  as  well  as  four  additional  independent  cases,  Patients  5-8,  for  the  qPCR 
validation  study.  We  selected  three  candidate  genes  for  validation,  CEACAM5,  SLC2A1,  and 
PTBP3,  as  all  of  them  showed  increased  expression  in  the  premalignant  lesions  and  tumors 
compared  to  normal  basal  cells.  For  each  of  these  genes,  we  examined  the  changes  of  mRNA 
levels  between  intermediate  lesions  and  the  normal  basal  cells  across  all  six  patients  (Figure 
24A).  The  mRNA  levels  of  CEACAM5  and  SLC2A1  were  validated  to  be  hi  gher  in  the 
intermediate  lesions  compared  to  normal  basal  cells  across  all  patients.  For  PTBP3,  two  of  the 
six  patients  examined  showed  lower  mRNA  levels  in  the  intermediate  lesions  than  normal  basal 
cells. 


Figure  24.  Validation  of  RNA-seq  data  at  the  mRNA  and  protein  level. 

A.  Q-PCR  with  Taqman  primer  probe  sets  showed  increased  gene  expression  of  SLC2A1  and 
CEACAM5  in  premalignant  lesions  and  tumors  as  compared  to  normal  basal  cells.  PTBP3  expression 
was  similarly  increased,  except  in  2  patients.  B-D.  Immunofluorescent  staining  for  CEACAM5, 
SLC2A1  and  PTBP3,  respectively.  All  3  proteins  were  expressed  at  higher  levels  in  premalignant 
lesions  and  NSCLCs  as  compared  to  normal  basal  cells.  Scale  bars  =  100  microns.  Blue  =  DAPI 
staining.  KRT5  =  keratin5  ,  a  basal  cell  marker. 
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We  picked  these  3  genes  for  validation  because  they  all  could  play  critical  roles  in  the  biology  of 
premalignant  lesions  and  lung  carcinogenesis.  SLC2A1  is  also  known  as  glucose  transporter  1 
(GLUT-1)  and  i  s  a  facilitative  glucose  transporter.  This  isoform  may  be  r  esponsible  for 
constitutive  or  basal  glucose  uptake  and  has  broad  substrate  specificity.  It  is  critical  in  glucose 
metabolism,  and  driving  glycolysis  in  particular.  CEACAM5,  a  cell  surface  glycoprotein  that 
plays  a  role  in  cell  adhesion  and  in  intracellular  signaling,  is  part  of  the  CEACAM  family  that  is 
important  in  other  epithelial  cell  cancers,  such  as  colon  cancer.  PTBP3  functions  as  an  RNA- 
binding  protein  that  mediates  pre-mRNA  alternative  splicing  regulation.  It  plays  a  role  in  the 
regulation  of  cell  proliferation,  differentiation  and  migration  and  is  a  positive  regulator  of 
erythropoietin-dependent  erythropoiesis. 

It  has  been  reported  that  the  correlation  between  mRNA  levels  and  protein  levels  is  not  always 
straightforward.22"24  To  further  investigate  if  the  changes  in  transcript  levels  in  the  intermediate 
lesions  for  the  selected  genes  do  c  arry  on  to  the  protein  levels,  we  performed  tissue 
immunofluorescence  staining  of  SLC2A1 ,  CEACAM5,  and  P  TBP3  on  t  issues  with  normal 
epithelium,  premalignant  lesions,  and  carcinomas  from  two  independent  cases  (Patients  9  &  10) 
(Figure  24B-D).  Since  the  PTBP3  data  were  not  validated  in  Patient  6  and  P  atient  8  with 
qPCR,  additional  immunofluorescence  staining  of  PTBP3  was  performed  on  these  two  patients 
to  examine  if  the  data  could  be  validated  at  the  protein  level.  We  found  that  CEACAM5  and 
SLC2A1  were  not  expressed  in  the  normal  epithelium,  but  were  highly  expressed  in 
premalignant  lesions.  SLC2A1  was  also  highly  expressed  in  the  tumor,  erefore  validating  the 
RNA-seq  data  at  the  protein  level.  Similarly,  we  found  that  CEACAM5  was  expressed  at  a  low 
level  in  the  normal  airway  basal  cells,  had  i  ncreased  protein  expression  in  the  premalignant 
lesions  and  higher  expression  in  tumors.  Unlike  SLC2A1 ,  CEACAM5  was  expressed  in  some 
but  not  all  tumor  cells.  PTBP3  was  expressed  in  the  columnar  cells  of  normal  epithelium  but 
showed  low  to  no  expression  in  the  basal  cells.  PTBP3  expression  was  also  detected  in 
premalignant  lesions  and  in  tumor  cells  and  some  mesenchymal  cells  surrounding  the  tumor. 


We  also  examined  the  relationship  between  smoking 
history  and  the  expression  of  SLC2A1  and  CEACAM5 
in  the  airway  epithelium  in  a  previous  gene  expression 
profiling  study  of  bronchoscopic  brushings  (Figure  25) 
[Spira  et  al  PNAS  2004].  The  expression  of  both  genes 
was  strongly  increased  in  brushings  from  current 
smokers  compared  with  those  from  never  smokers 
(p=8.5x10"8  and  2.2x1 0"14,  respectively).  However,  the 
two  genes  differed  in  their  reversibility  after  smoking 
cessation.  SLC2A1  expression  was  not  significantly 
different  between  former  and  never  smokers  (p=0.20), 
but  was  significantly  different  between  current  and 
former  smokers  (p=0.012).  Conversely,  CEACAM5 
expression  was  not  significantly  different  between 
current  and  former  smokers  (p=0.061),  but  was  still 
significantly  induced  in  former  smokers  relative  to  never 
smokers  (p=9.9x10  5).  These  data  indicate  that  SLC2A1 
and  CEACAM5  are  reversibly  and  irreversibly  induced 
by  smoking,  respectively. 


In  this  study,  we  have  used  laser  microdissection  to  obtain  materials  from  the  tumors, 
premalignant  lesions,  and  normal  cells  of  the  same  patient,  which  offers  a  m  ore  accurate 
depiction  of  differential  gene  expression  than  in  previously  published  lung  cancer  studies  of 


SLC2A1 


CEACAM  5 


Figure  25.  Expression  of  SLC2A1  and 
CEACAM5  expression  in  the  bronchial 
epithelium  is  associated  with  smoking 
status.  Gene  expression  was  profiled  by 
microarray  in  bronchial  brushings  from 
never  (N),  former  (F)  and  current  (C) 
smokers.  [Spira  et  al  PNAS  2004]  *  p  < 
0.05  by  Student’s  t  test;  ***  p  <  0.001 . 


21 


undissected  lung  tissue  sections.25"27  These  data  are  currently  the  subject  of  a  manuscript  in 
preparation. 

C.  Identification  of  additional  archived  clinical  specimens  for  laser  microdissection  of 
tumor-initiating  stem/progenitor  cells: 

Additional  archived  clinical  specimens  from  which  we  previously  extracted  DNA  and  RNA  from 
rare  target  cells  using  laser  microdissection  were  identified.  Within  the  same  individual,  the 
following  regions/cells  were  present:  normal  epithelium,  normal  basal  cells,  dysplastic  cells  in 
preinvasive  tissue,  as  well  as  their  respective  basal  cells,  and  invasive  tumor  cells.  The  cases 
identified  included  biopsy  specimens  and  resected  tumors,  and  each  was  confirmed  to  be 
amenable  for  microdissection  after  review  of  freshly  cut  H&E  stained  slides  by  our  collaborating 
pathologists.  Using  archived  clinical  specimens  available  in  biorepositories  at  each  site,  we 
have  sufficient  material  to  complete  the  studies  described  in  sub-specific  Aim  2B. 

D.  Feasibility  of  performing  shotgun  proteomics  on  rare  target  cells  and  arriving  at  a 
robust  protein  signature  (Sub-specific  Aim  2B): 

As  described  in  the  previous  annual  report  (Year  1),  we  have  identified  a  molecular  profile 
dominated  by  the  Snail  transcription  factor  that  appears  to  drive  epithelial  mesenchymal 
transition  (EMT)  and  tumor-initiating  characteristics  in  the  airway  epithelium,  as  modeled  in  vitro 
and  in  vivo.  Snail  is  also  over-expressed  in  human  bronchial  epithelial  cells  in  premalignant 
lesions  in  situ  concomitant  with  markers  of  EMT  and  sternness.  To  validate  the  technology  for 
analyzing  tumor-initiating  stem  cells  from  in  situ  specimens,  we  performed  preliminary  Shotgun 
Proteomic  Analysis  comparing  in  vitro  human  bronchial  epithelial  cells  and  t  he  same  cells 
ectopically  over-expressing  Snail.  Cell  pellets  were  collected,  prepared  for  LC-MS/MS  shotgun 
proteomics,  and  analyzed,  as  described  in  detail  in  our  year  1  progress  report.  Briefly,  each  0.2 
mg  protein  aliquot  was  digested  and  resolved  by  isoelectric  focusing  into  15  fractions  that  were 
subsequently  analyzed  by  LC-MS/MS.  Thus,  there  were  6  measurements  (2  technical  replicates 
for  3  samples)  for  the  control  group  and  6  for  the  Snail+  group.  Raw  MS/MS  data  were 
evaluated  using  MyriMatch  and  IDPicker  software.  Differentially  expressed  proteins  were  then 
identified  using  Quasi-Tel  pair  wise  comparison.  The  initial  dataset  was  robust,  with  2809 
protein  groups  identified  overall;  a  protein  group  usually  represents  a  single  protein,  but  it  is 
occasionally  a  s  mall  group  of  indistinguishable  proteins  with  identical  peptides.  The  overall 
numbers  of  protein  groups  in  the  control  and  Snail+  bronchial  epithelial  cells  were  similar  (2229 
and  2738,  respectively).  The  following  general  observations  were  made:  (1)  Known  markers  of 
EMT  were  over-expressed  in  the  Snail+  cells.  (2)  Other  structural/motility  proteins  consistent 
with  an  EMT  phenotype  were  also  over-expressed  in  the  Snail+  cells. 

To  augment  our  ability  to  identify  proteins  relevant  to  the  molecular  pathogenesis  of  lung  cancer 
across  the  broadest  possible  patient  population,  we  will  perform  shotgun  proteomics  on 
additional  samples.  A  panel  of  bronchial  epithelial  cells  isolated  from  patients  and  engineered  to 
over-express  Snail  have  been  plated  in  western  blot  and  anchorage  independent  growth  (AIG) 
assays.  Via  these  assays,  we  will  re-confirm  their  Snail  expression  and  Snail-driven  malignant 
conversion  prior  to  their  proteomic  evaluation.  Each  of  the  cell  types  in  this  panel  has  previously 
demonstrated  numerous  Snail-driven  cancer-associated  phenotypes,  including  EMT,  sternness, 
AIG,  and/or  tumor  growth  and  metastatic  behavior  in  mice.  At  the  conclusion  of  these  assays, 
cells  maintained  in  culture  in  parallel  will  be  collected,  prepared  for  shotgun  proteomics,  and 
analyzed,  as  previously  described.  Additionally,  tumor-initiating  Snail+ALDH+CD44+CD24- 
bronchial  epithelial  cells  will  be  subjected  to  shotgun  proteomics  in  the  same  manner.  This 
relatively  rare  cell  type  will  be  isolated  by  fluorescence  activated  cell  sorting,  and  the  resulting 
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cell  pellets  will  be  frozen.  Multiple  pellets  will  be  pool  ed  to  generate  material  sufficient  for 
evaluation;  this  again  models  laser  microdissection  of  rare  stem/progenitor  target  cells  from 
archived  clinical  specimens. 

By  evaluating  additional  samples,  including  rare  tumor-initiating  stem  cells,  we  anticipate 
arriving  at  a  more  robust  protein  signature  relevant  to  lung  carcinogenesis.  Models  and  software 
developed  in  the  Jim  Ayers  Institute  at  Vanderbilt,  in  collaboration  with  Dr.  Dan  Liebler,  are 
more  appropriately  applied  to  studies  with  these  multiple  inputs.  The  new  protein  signature  that 
emerges  will  be  further  strengthened  via  Multiple  Reaction  Monitoring  (MRM)  performed  on  the 
remaining  samples.  MRM  using  mass  spectrometry  is  a  highly  sensitive  and  selective  method 
for  the  targeted  quantitation  of  protein  or  peptide  abundances  in  complex  samples.  While 
shotgun  proteomics  detects  all  protein  changes  in  the  sample  in  an  unfocused  fashion,  MRM  is 
targeted  and  highly  selective,  allowing  us  to  specifically  look  for  proteins  of  interest.  Dr.  Liebler 
has  developed  and  refined  a  series  of  MRM  assays  that  can  be  applied  to  the  airways  of 
individuals  for  lung  cancer.  The  relevant  method  developments  were  recently  published  in  MCP 
28.  To  this  end,  we  have  generated  a  I  ist  of  candidate  proteins  for  MRM  utilizing  shotgun 
proteomic,  mRNA  array,  and  m  iRNA  array  datasets  generated  from  the  same  Snail+  cells. 
Candidates  with  the  greatest  fold  change  and  level  of  significance  were  included.  Candidates  at 
the  intersection  of  each  of  these  lists,  as  evaluated  by  Ingenuity  Pathway  Analysis  (IPA),  were 
also  included.  Finally,  additional  candidates  of  interest  were  included  based  on  our  hypothesis- 
driven  studies  of  lung  carcinogenesis,  including  mediators  of  inflammation,  EMT,  sternness, 
metabolism,  apoptosis-resistance,  as  evaluated  in  the  Pis’  lab-based  studies  over  the  past 
several  years.  While  we  have  already  generated  this  candidate  list  for  MRM,  the  list  will  be 
further  refined  as  we  expand  our  shotgun  proteomic  analysis  to  include  additional  samples. 

Finally,  during  the  preceding  funding  period,  we  developed  a  Microsoft  Access  database  with 
the  intent  of  including  an  additional  parameter,  “druggability”,  in  the  selection  of  top  candidates 
for  further  validation  and  det  ailed  functional  studies.  The  first  iteration  of  the  database  was 
created  by  integrating  the  protein,  mRNA,  and  miRNA  datasets  previously  described  with  a  lung 
cancer-specific  terms  list.  These  were  then  linked  to  information  regarding 
proteins/genes/miRNAs  for  which  agents  are  in  use  or  in  the  pipeline  along  with  additional 
clinical  utility  parameters,  such  as  how  successful  the  agent  is  and  its  range  of  use.  This 
database  will  be  refined  as  we  expand  our  shotgun  proteomic  analysis  to  additional  samples 
and  as  we  receive  inputs  from  the  sequencing  and  array  studies  in  the  other  aims.  This 
database  will  serve  as  an  important  new  tool  for  selecting  the  best  protein  candidates  to  include 
in  our  upcoming  MRM  studies. 

Specific  Aim  3:  Test  airway-based  mRNA  and  microRNA  biomarkers  of  diagnosing  lung 
cancer  in  current  and  former  smokers  at  high  risk  for  lung  cancer  in  minimally  invasive 
sites. 

Summary  of  Research  Findings: 

The  studies  on  this  Aim  will  be  carried  out  in  Years  3  and  4  of  the  grant. 

KEY  RESEARCH  ACCOMPLISHMENTS 

•  Identified  aberrant  activation  of  canonical  oncogenes  in  the  molecular  field  of  injury  of  early 
stage  NSCLC  patients  including  phosphorylated  AKT  and  ERK  kinases. 

•  Completed  the  analysis  and  characterization  of  the  molecular  localized  field  cancerization 
using  194  samples  comprised  of  tumor,  normal  lung  and  airway  samples  from  20  NSCLC 
cases. 
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•  Derived  field  cancerization  expression  signatures,  comprised  of  genes  concordantly  and 
significantly  differentially  expressed  between  tumors  and  ai  rways  compared  to  matched 
normal  lung  tissue,  and  pertinent  to  NSCLC,  lung  adenocarcinomas  and  SCCs. 

•  In  collaboration  with  BU  (Partnering  institution,  demonstrated  that  molecular  profiles  in  the 
localized  field  cancerization  are,  at  least  in  part,  relevant  to  the  molecular  field  of  injury  and 
contain  markers  significantly  and  concordantly  different  between  airways  of  patients  with 
and  without  lung  cancer 

•  Identified  profiles  that  were  significantly  progressively  and  di  fferentially  expressed  by 
distance  from  corresponding  lung  tumors  and  concordantly  modulated  between  tumors  and 
paired  uninvolved  normal  lung  tissues,  pinpointing  to  their  probable  roles  in  pathogenesis. 

•  Confirmed  the  differential  expression  of  several  site-dependent  and  -independent  markers 
identified  by  microarray  analysis  of  the  field  cancerization  at  the  RNA  and  protein  level  by 
QRTPCR  and  immunohistochemical  analysis,  respectively. 

•  Identified  previously  unknown  promalignant  roles  for  LAPTM4B,  one  of  the  top  5  site- 
dependent  field  cancerization  markers  we  had  found,  in  NSCLC  evidenced  by  reduced 
clonogenic  growth  of  cells  with  knockdown  of  the  gene. 

•  Began  to  analyze  by  RNA-Seq  (20  cases,  156  samples)  and  comprehensive  microarray 
profiling  (28  cases,  183  samples)  the  molecular  field  of  injury  in  NSCLC  patients  and  cancer- 
free  individuals. 

•  Performed  RNA  sequencing  on  c  ell  populations  in  matched  sets  of  histologically  normal 
airway,  premalignant  lesions  and  tumors  from  the  same  individuals,  and  identified  candidate 
genes  that  increase  in  expression  in  premalignancy  and  in  tumors. 

•  Identified  and  validated  candidate  genes  and  their  proteins  that  are  likely  to  play  a  role  in  the 
biology  of  premalignancy  and  lung  carcinogenesis  including  SLC2A1  (glucose  transporter), 
CEACAM5  (cell  adhesion  molecule)  and  PTBP3  (regulator  of  cell  proliferation,  differentiation 
and  migration). 

REPORTABLE  OUTCOMES 


Abstracts: 

•  Kadara  H,  Fujimoto  J,  Yoo  SY,  Garcia  M,  Kabbout  M,  Basey  A,  Wang  J,  Coombes  KR,  Kim 
ES,  Hong  WK,  Kalhor  N,  Moran  C,  Wistuba  II.  Gene  expression  profiling  of  lung  tumors  and 
matched  normal  airways  reveals  common  and  disparate  aberrant  pathways  in  squamous 
cell  carcinoma  and  adenocarcinoma  development  and  potential  targets  for  chemoprevention 
in  early  stage  lung  cancer  patients.  Proceedings  of  the  103rd  Annual  Meeting  of  the 
American  Association  for  Cancer  Research;  2012  Mar  31-  Apr  4;  Chicago,  Illinois. 
Philadelphia  (PA):  AACR;  2012.  Abstract  #1721 . 

•  Ooi  AT,  Gower  AC,  Zhang  KX,  Vick  J,  Caballero  N,  Massion  PP,  Wistuba  II,  Walser  TC, 
Dubinett  SM,  Pellegrini  M,  Lenburg  ME,  Spira  A  and  Gomperts  BN.  Molecular  Profiles  to 
Improve  our  Understanding  of  Lung  Cancer  Pathogenesis  in  U.S.  Veterans.  NIH  Lung 
Cancer  SPORE  Meeting.  Pittsburgh.  July  2012. 

Manuscripts: 

•  Kadara  H,  Wistuba  II.  Field  cancerization  in  NSCLC:  implications  in  disease  pathogenesis. 
Proceedings  of  the  American  Thoracic  Society,  9(2):38-42  2012. 

•  Fujimoto  J*,  Kadara  H*  (*  equal  contributing  first  authors),  Garcia  MM,  Kabbout  M,  Behrens 
C,  Liu  DD,  Lee  JJ,  Solis  LM,  Kim  ES,  Kalhor  N,  Moran  C,  Shalafkhaneh  A,  Lotan  R,  Wistuba 
II.  G-protein  coupled  receptor  family  C  group  5  member  A  (GPRC5A)  expression  is 
decreased  in  the  adjacent  field  and  normal  bronchial  epithelia  of  patients  with  chronic 
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obstructive  pulmonary  disease  and  non  -small  cell  lung  cancer.  Journal  of  Thoracic 
Oncology.  In  Press. 

•  Kadara  H,  Shen  L,  Fujimoto  J,  Saintigny  P,  Chow  CW,  Lang  W,  Chu  Z,  Garcia  M,  Kabbout 
M,  Fan  YH,  Behrens  C,  Liu  D,  Mao  L,  Lee  J  J,  Gold  KA,  Wang  J,  Coombes  K,  Kim  ES,  Hong 
WK,  Wistuba  II.  Characterizing  the  molecular  spatial  and  temporal  field  of  injury  in  early 
stage  smoker  non-small  cell  lung  cancer  patients  after  definitive  surgery  by  expression 
profiling.  Cancer  Prevention  Research.  In  Press. 

CONCLUSIONS 


During  our  second  year  of  research,  we  derived  expression  profiles  signifying  the  molecular  field 
cancerization  in  early-stage  NSCLC.  We  also  showed  that  the  field  cancerization  profiles, 
identified  in  distal  airways  of  resected  specimens,  are,  at  least  in  part,  relevant  to  the  extended 
field  of  injury  and  differentially  expressed  between  large  airways  of  patients  with  and  without 
lung  cancer.  In  addition,  our  profiling  studies  revealed  expression  patterns  in  bronchial  epithelia 
that  are  significantly  progressively  modulated  with  distance  from  corresponding  tumors  and, 
importantly,  concordantly  modulated  between  tumors  and  pai  red  uninvolved  normal  lung 
tissues,  pinpointing  their  probable  roles  in  pathogenesis.  In  addition,  based  on  functional 
pathways  analysis  and  statistical  threshold  criteria,  we  validated  the  differentially  expression  of 
several  key  site-dependent  and-independent  field  cancerization  markers.  Furthermore,  we 
found  that,  LAPTM4B,  a  putative  oncogene  with  no  known  role  in  lung  carcinogenesis  and  one 
of  top  5  s  ite-dependent  field  cancerization  markers  that  increased  with  shorter  distance  from 
tumors  in  our  profiling  studies,  was  a  positive  mediator  of  the  malignant  phenotype.  We  also 
used  a  unique  approach  to  profile  cell  populations  from  the  normal  airway,  premalignant  lesions 
and  tumors  and  were  able  to  validate  these  genes.  We  identified  SLC2A1,  CEACAM5  and 
PTBP3  as  genes  that  are  upregulated  in  premalignancy  and  squamous  lung  cancer.  In 
particular,  the  glucose  transporter  SLC2A1  represents  a  novel  potential  target  for  preventing  or 
treating  premalignant  lesions  and,  potentially,  lung  cancer.  Current  efforts  are  underway  to 
profile  and  perform  RNA-Seq  on  the  complete  topological  map  of  the  field  of  injury  in  both 
NSCLC  patients  and  cancer-free  individuals,  which  is  expected  to  yield  airway  biomarkers  for 
lung  cancer  to  be  tested  in  future  aims  of  the  study. 
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Gene  expression  profiling  of  lung  tumors  and  matched  normal  airways  reveals  common  and 
disparate  aberrant  pathways  in  SCC  and  adenocarcinoma  development  and  potential  targets  for 
chemoprevention  in  early  stage  NSCLC  patients. 

Humam  Kadara1,  Junya  Fujimoto1,  Suk-Young  Woo3,  Melinda  Garcia1,  Mohamed  Kabbout1,  Annette 
Basey2,  Jing  Wang3,  Kevin  R.  Coombes3,  Neda  Kalhor2,  Cesar  Moran2  and  Ignacio  I.  Wistuba1,2. 

Departments  of  lrThoracic/Head  and  Neck  Medical  Oncology,  2Pathology  and  3Bioinformatics,  The 
University  of  Texas  MD  Anderson  Cancer  Center,  Houston,  Texas,  USA. 


Chemoprevention  of  non-small  cell  lung  cancer  (NSCLC)  has  been  unsuccessful  in  part  due  to  our  limited 
knowledge  of  its  pathogenesis.  It  has  been  suggested  that  normal  airway  epithelia  share  molecular 
abnormalities  with  tumors  and  may  serve  as  progenitors  for  lung  malignancies.  We  sought  to  analyze 
molecular  profiles  of  lung  adenocarcinomas  and  squamous  cell  carcinomas  (SCCs),  two  major  subtypes 
of  NSCLC,  and  matched  normal  airways  to  elucidate  aberrant  expression  patterns  in  early  phases  of  lung 
tumorigenesis.  All  specimens  were  obtained  from  primary  early  stage  NSCLC  consented  patients  who  did 
not  receive  neoadjuvant  therapy  (n=20).  We  profiled  RNA  isolated  from  tumors  and  normal  lung  as  well 
as  from  brushings  of  multiple  airways  adjacent  to  NSCLC  that  were  histopathologically  confirmed  to  lack 
neoplastic  or  preneoplastic  cells  (n=194).  Expression  signatures  signifying  genes  that  were  significantly 
and  concurrently  differentially  expressed  between  both  tumors  and  airways  compared  to  normal  lung 
tissue  (tumor-airway-normal/TAN  signatures)  were  then  derived  independently  for  SCC  and  smoker 
adenocarcinoma  cases  and  were  comprised  of  1,803  and  1,938  genes,  respectively.  The  TAN  signatures 
effectively  clustered  tumor  and  airways  from  matched  normal  lung  samples  (p<0.001).  Further  analysis 
showed  that  a  subset  of  the  genes  separated  SCC-  and  adenocarcinoma-adjacent  airways.  Moreover, 
pathways  and  gene-network  analysis  using  Ingenuity  pathways  software  highlighted  similarities  and 
differences  in  pathway  modulation  between  airway  epithelial  fields  of  SCCs  and  adenocarcinomas. 
Embryonic  stem  cell  and  eicosanoid  signaling  pathways  were  most  significantly  modulated  among  those 
common  to  both  TAN  signatures  (p<0.001).  Importantly,  retinoic  acid  receptor  and  stem  cell  signaling 
pathways  mediated  by  NANOG  and  lineage-specific  oncogene,  SOX2,  were  most  significantly  modulated 
in  the  SCC  TAN  signature,  whereas  NF-kB  and  PTEN  signaling  pathways  were  most  prevalent  in  the 
adenocarcinoma  TAN  airway  signature  (all  p<0.001).  Gene  networks  mediated  by  decreased  expression 
of  lineage-specific  oncogene,  NKX2-1/TITF-1,  and  increased  expression  of  TP63  and  networks  mediated 
by  increased  expression  of  the  MET  and  ERBB2  oncogenes  were  predominantly  functionally  modulated 
in  the  SCC  and  adenocarcinoma  TAN  signatures,  respectively.  Quantitative  PCR  analysis  confirmed  the 
up-regulation  of  the  MET  oncogene  in  adenocarcinomas  and  normal  airways  compared  to  normal  lung. 
These  findings  highlight  expression  patterns  and  pathways  that  are  deregulated  differentially  in  the 
pathogenesis  of  lung  adenocarcinomas  and  SCCs  and  therefore  offer  therapeutic  targets  to  guide 
personalized  chemoprevention  in  early  stage  NSCLC  patients.  Supported  by  DoD  W81XWH-10-1-1007. 
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Field  Cancerization  in  Non-Small  Cell  Lung  Cancer 

Implications  in  Disease  Pathogenesis 
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Lung  cancer,  of  which  non-small  cell  lung  cancer  (NSCLC)  composes 
the  majority,  is  the  leading  cause  of  cancer-related  deaths  in  the 
United  States  and  worldwide.  NSCLCs  are  tumors  with  complex  bi¬ 
ology  that  we  have  recently  started  to  understand  with  the  advent 
of  various  histological,  transcriptomic,  genomic,  and  proteomic 
technologies.  However,  the  histological  and  molecular  pathogene¬ 
sis  of  this  malignancy,  in  particular  of  adenocarcinomas,  is  still 
largely  unknown.  Earlier  studies  have  highlighted  a  field  canceriza¬ 
tion  phenomenon  in  which  histologically  normal-appearing  tissue 
adjacent  to  neoplastic  and  pre-neoplastic  lesions  display  molecular 
abnormalities,  some  of  which  are  in  common  with  those  in  the 
tumors.  This  review  will  summarize  advances  in  understanding  the 
field  cancerization  phenomenon  and  the  potential  relevance  of  this 
knowledge  to  gain  important  and  novel  insights  into  the  molecular 
pathogenesis  of  NSCLC  as  well  as  to  subsequent  development  of 
biomarkers  for  early  detection  of  lung  cancers  and  possibly  person¬ 
alized  prevention. 

Keywords:  lung  cancer;  field  cancerization;  pathogenesis;  airway  epi¬ 
thelium 

Lung  cancer  is  the  leading  cause  of  cancer  deaths  in  the  United 
States  and  worldwide  in  both  developing  and  developed  regions 
(1).  The  high  mortality  of  this  disease  is  in  part  due  to  our 
lacking  knowledge  of  the  molecular  mechanisms  governing  lung 
cancer  pathogenesis  as  well  as  the  late  diagnosis  of  the  majority 
of  lung  cancers  after  regional  or  distant  spread  of  the  malig¬ 
nancy  (2).  Non-small  cell  lung  cancer  (NSCLC)  represents  the 
majority  of  diagnosed  lung  cancers  (2)  and  is  mainly  composed 
of  squamous  cell  carcinomas  (SCCs)  and  lung  adenocarcinomas 
(2,  3).  Several  major  differences  exist  between  adenocarcinomas 
and  SCCs.  For  example,  compared  with  SCCs  that  arise  from 
the  major  bronchi  and  are  mainly  centrally  located,  pulmonary 
adenocarcinomas  arise  from  small  bronchi,  bronchioles,  or  al¬ 
veolar  epithelial  cells  and  are  typically  peripherally  located,  as 
reviewed  elsewhere  (2-5).  Moreover,  whereas  SCC  pathogenesis 
is  strongly  linked  to  smoking,  lung  adenocarcinoma  is  the  more 
common  histological  subtype  in  never-smoker  patients  (6-9).  Al¬ 
though  the  sequence  of  lesions  in  the  pathogenesis  leading  to 
SCCs  is  well  described,  little  is  known  about  the  sequential  de¬ 
velopment  of  adenocarcinomas.  Moreover,  we  are  still  lacking  in 
our  knowledge  of  differential  mechanisms  of  molecular  patho¬ 
genesis  among  both  subtypes  of  NSCLC. 
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In  light  of  the  postulated  period  of  time  and  multiple  stages  re¬ 
quired  for  the  development  of  overt  epithelial  tumors,  it  is  plau¬ 
sible  to  assume  that  early  diagnosis  of  lung  cancer  or  intraepithelial 
lesions  coupled  with  effective  prevention  strategies  will  reduce  the 
significant  health  burden  associated  with  this  disease  (10).  Despite 
recent  encouraging  findings  from  the  National  Lung  Screening 
Trial  (NLST)  (11),  early  detection  and  prevention  of  lung  cancer 
is  challenging  due  to  the  lack  of  biomarkers  for  early  diagnosis  of 
the  disease  and  to  the  presence  of  multiple  neoplastic  molecular 
pathways  that  mediate  lung  carcinogenesis.  Earlier  studies  have 
highlighted  a  field  cancerization  phenomenon  in  which  histolog¬ 
ically  normal-appearing  tissue  adjacent  to  neoplastic  and  pre¬ 
neoplastic  lesions  displays  molecular  abnormalities,  some  of  which 
are  in  common  with  those  in  the  tumors.  It  is  plausible  to  assume 
that  understanding  early  events  in  tumor  development  that  com¬ 
mence  in  histologically  normal  epithelium  would  pave  the  way  for 
unmet  effective  and  personalized  strategies  for  lung  cancer  pre¬ 
vention  and  treatment.  This  review  mainly  summarizes  advances 
in  understanding  the  field  cancerization  phenomenon  and  the  po¬ 
tential  relevance  of  this  knowledge  to  gain  important  and  novel 
insights  into  the  molecular  pathogenesis  of  NSCLC  as  well  as  to 
subsequent  development  of  biomarkers  for  early  detection  of  lung 
cancers  and  possibly  personalized  prevention. 

NSCLC  PRE-NEOPLASIA  AND 
MOLECULAR  PATHOGENESIS 

From  biological  and  histopathological  perspectives,  NSCLC  is 
a  complex  malignancy  that  develops  through  multiple  pre-neoplastic 
pathways.  Lung  adenocarcinoma,  a  major  subtype  of  NSCLC,  has 
been  increasing  in  incidence  globally  in  both  smokers  and  non- 
smokers  (9),  with  a  concurrent  decrease  in  SCC  frequency.  It 
has  been  suggested  that  the  increasing  incidence  of  lung  adeno¬ 
carcinomas  compared  with  SCCs  is  in  part  due  to  the  change  in 
the  type  of  cigarettes  used  (lower  nicotine  and  tar)  and  smoking 
habits  and  behavior  (7).  Anatomical  differences  in  the  location  of 
diagnosed  lung  adenocarcinomas  and  SCCs  strongly  suggest  that 
both  NSCLC  subtypes  develop  through  different  histopathologi¬ 
cal  and  molecular  pathways  and  have  different  cells  of  origin; 
however,  the  specific  respiratory  epithelial  cell  type  from  which 
each  lung  cancer  type  develops  has  not  been  established  with 
certainty  (3).  However,  it  is  noteworthy  that  Clara  cells  and  the 
type  II  pneumocytes  are  believed  to  be  the  progenitor  cells  of  the 
peripheral  airways,  and  peripherally  arising  adenocarcinomas  of¬ 
ten  express  markers  of  these  cell  types  (12,  13). 

The  multistage  stepwise  fashion  of  tumor  development  has  been 
demonstrated  in  various  anatomical  organs  exemplified  by  the  op¬ 
erational  stages  occurring  during  skin  carcinogenesis  (14).  Carcino¬ 
genesis  of  the  skin  is  initiated  by  a  carcinogen-induced  mutational 
event,  promoted  by  clonal  outgrowth,  which  may  be  dependent  on 
tumor  promoters,  followed  by  progression  of  premalignant  lesions 
(e.g.,  papillomas)  and  their  conversion  to  malignant  tumors  (14). 
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and  7)  are  associated  with  premalignant  progression  of  mouse  skin 
papillomas  (14-17).  The  sequence  of  histopathological  and  molec¬ 
ular  changes  in  bronchial  epithelia  that  precede  the  development 
of  lung  SCCs  have  been  characterized,  demonstrating  that  sequen¬ 
tially  occurring  allele-specific  molecular  changes  commence  in  dis¬ 
persed  foci  signifying  a  multistage  fashion  of  squamous  lung 
cancer  pathogenesis  (4,  18).  At  least  in  this  subtype  of  NSCLC, 
it  has  been  shown  that  specific  genomic  alterations  (mainly  3p21, 
3p22-24,  3p25,  and  9p21)  occur  in  histologically  normal  bronchial 
epithelia  from  resected  specimens  (18).  Moreover,  and  notably, 
these  alterations  persisted  in  hyperplasias,  dysplasias,  carcinomas 
in  situ ,  and  tumors  that  exhibited  different  commencing  genomic 
aberrations  (18).  In  addition,  loss  of  heterozygosity  (LOH)  in 
the  3p  region  was  demonstrated  in  normal  bronchial  epithelia  of 
cancer-free  smokers,  further  highlighting  the  early  role  this  specific 
genomic  alteration  exerts  in  lung  cancer  pathogenesis  (19,  20). 
It  is  noteworthy  that  McCaughan  and  colleagues  demon¬ 
strated  that  no  low-grade  lesions,  but  all  high-grade  lesions, 
exhibited  3q  amplification  targeting  the  sex  determining  re¬ 
gion  Y-box  2  ( SOX2 )  lineage-specific  oncogene  (21).  These 
previous  findings  and  reports,  through  highlighting  associations 
between  histopathological  sequences  and  specific  molecular 
aberrations,  pinpoint  to  a  multistage  and  multistep  manner  of 
lung  carcinogenesis. 

On  the  other  hand,  one  cannot  neglect  the  alternative  hypoth¬ 
esis  that  the  sequence  of  genetic  and  epigenetic  alterations  is 
irrelevant  to  lung  cancer  pathogenesis,  but  rather  the  accumula¬ 
tion  of  molecular  abnormalities  beyond  a  certain  threshold  medi¬ 
ates  development  of  the  malignant  phenotype.  It  has  been 
suggested  that  at  least  two  molecular  pathways,  the  Kirsten  rat 
sarcoma  viral  oncogene  (KRAS)  and  epidermal  growth  factor 
receptor  ( EGFR )  pathways,  are  involved  in  the  development 
of  smoker  and  never-smoker  adenocarcinomas,  respectively  (2, 
9).  Moreover,  and  as  reviewed  by  Yatabe  and  colleagues,  atyp¬ 
ical  adenomatous  hyperplasias  (AAHs),  which  are  considered  to 
be  precursor  lesions  for  peripheral  lung  adenocarcinomas  (3,  5) 
and  the  only  sequence  of  morphologic  change  identified  so  far 
for  the  development  of  invasive  lung  adenocarcinomas,  exhibit 
KRAS  mutations  more  frequently  than  invasive  adenocarcino¬ 
mas  (5).  Conversely,  our  group  has  previously  demonstrated  that 
EGFR  mutations  commence  in  histologically  normal  bronchial 
epithelia  adjacent  to  lung  adenocarcinomas  and  precede  copy 
number  increase  of  the  oncogene  (22,  23).  EGFR  mutations  also 
are  persistent  throughout  the  different  phases  of  lung  adenocar¬ 
cinoma  development  (5),  which  harbor  different  genomic  alter¬ 
ations  (24).  It  is  plausible  to  surmise  that  only  after  increasing 
our  knowledge  of  the  pre-neoplastic  changes  as  well  as  the  cor¬ 
responding  molecular  abnormalities  leading  to  the  development 
of  lung  adenocarcinomas  would  we  then  be  able  to  more  confi¬ 
dently  determine  whether  adenocarcinomas  follow  a  linear  pro¬ 
gression  mechanism  or  not  (5).  However,  and  based  on  the 
aforementioned  previously  reported  observations  by  our  group 
and  others,  we  believe  that  it  is  not  counterintuitive  to  speculate 
that  development  of  lung  malignant  phenotype,  including  that  of 
adenocarcinomas,  is  due  to  stepwise,  sequence-specific,  and  mul¬ 
tistage  molecular  pathogenesis  as  well  as  accumulation  and  com¬ 
bination  of  genetic  and  epigenetic  abnormalities. 

FIELD  CANCERIZATION 

Smoking-Damaged  Epithelium  and  the  Field 
Cancerization  Phenomenon 

Although  the  majority  of  lung  cancer  patients  are  current  or  for¬ 
mer  smokers,  a  relatively  small  proportion  of  these  smokers 


mary  tumor  development  after  definitive  treatment  by  resective 
surgery.  There  is  a  large  body  of  evidence  that  heavy  smokers 
and  patients  who  have  survived  lung  cancer  compose  a  high- 
risk  population  that  may  be  targeted  for  early  detection  and  che- 
moprevention  efforts  (4).  Although  the  risk  of  developing  lung 
cancer  decreases  after  smoking  cessation,  the  risk  never  returns 
to  baseline.  Pre-neoplastic  changes  have  been  used  as  surrogate 
endpoints  for  chemopreventive  studies.  However,  it  was  sug¬ 
gested  that  this  “shooting  in  the  dark”  approach  may  explain 
the  reasons  behind  the  general  failures  of  clinical  chemopreven- 
tion  studies  (10).  Therefore,  novel  approaches  to  identify  the  best 
population  to  be  targeted  for  early  detection  and  chemopreven- 
tion  should  be  devised,  and  risk  factors  for  lung  cancer  develop¬ 
ment  or  relapse  need  to  be  better  defined.  For  these  important 
purposes,  a  better  understanding  of  the  biology  and  molecular 
origins  of  lung  cancer  is  warranted. 

Earlier  work  by  Danely  Slaughter  in  patients  with  oral  cancer 
and  oral  premalignant  lesions  has  suggested  that  histologically 
normal-appearing  tissue  adjacent  to  neoplastic  and  pre-neoplastic 
lesions  display  molecular  abnormalities,  some  of  which  are  in 
common  with  those  in  the  tumors  (25).  In  1961,  a  seminal  report 
by  Auerbach  and  colleagues  suggested  that  cigarette  smoke 
induces  extensive  histological  changes  in  the  bronchial  epithelia 
in  the  lungs  of  smokers  and  that  premalignant  lesions  are  wide¬ 
spread  and  multifocal  throughout  the  respiratory  epithelium, 
suggestive  of  a  field  effect  (26).  This  phenomenon,  coined  “field 
of  cancerization,”  was  later  shown  to  be  evident  in  various 
epithelial  cell  malignancies,  including  lung  cancer.  Some  degree 
of  inflammation  and  inflammatory-related  damage  is  almost  in¬ 
variably  present  in  the  central  and  peripheral  airways  of  smok¬ 
ers  and  may  precede  the  development  of  lung  cancer  (4).  Thus, 
the  field  of  cancerization  may  also  be  explained  by  both  the 
direct  effect  of  tobacco  carcinogens  and  the  initiation  of  inflam¬ 
matory  response.  In  this  context,  different  theories  for  the  ori¬ 
gin  of  the  field  cancerization  or  smoking-related  field  of  injury 
have  been  put  forward  and  extensively  reviewed  elsewhere  by 
Steiling  and  coworkers  (27). 

Multiple  altered  foci  of  bronchial  epithelium  are  present 
throughout  the  airway  in  patients  with  lung  cancer  and  smokers 
(18,  28,  29).  As  mentioned  before,  detailed  analysis  of  histol¬ 
ogically  normal,  premalignant,  and  malignant  epithelia  from 
patients  with  lung  SCC  indicated  that  multiple,  sequentially  oc¬ 
curring  allele-specific  chromosomal  deletions  of  LOH  commence 
in  clonally  independent  foci  early  in  the  multistage  pathogenesis 
of  SCCs  (18,  28).  Notably,  31%  percent  of  histologically  normal 
epithelium  and  42%  of  mildly  abnormal  (hyperplasia/metaplasia) 
specimens  had  clones  of  cells  with  allelic  loss  at  one  or  more 
regions  examined.  Moreover,  these  molecular  aberrations  were 
also  found  in  carcinomas  in  situ  and  SCCs  and  at  a  more  ad¬ 
vanced  level  (18).  Molecular  changes  involving  LOH  of  chromo¬ 
somal  regions  3p  (e.g.,  fragile  histidine  triad  gen dFHIT),  9p 
(e.g.,  CDKN2A/pl6 ),  genomic  instability  (increased  microsatel¬ 
lite  repeats),  and  pi 6  methylation  have  been  demonstrated  in 
histologically  normal  bronchial  epithelia  in  patients  with  SCC 
and  in  the  sequence  of  pathogenesis  of  the  disease  (3).  Moreover, 
Nelson  and  colleagues  demonstrated  that  KRAS  is  also  mutated 
in  histologically  normal  lung  tissue  adjacent  to  lung  tumors  (30). 
In  addition,  similar  epigenetic  and  gene  methylation  patterns 
between  tumors  and  adjacent  histologically  normal  epithelia 
were  described.  Belinsky  and  colleagues  reported  aberrant  pro¬ 
moter  methylation  of  pl6,  which  was  described  to  be  commonly 
methylated  in  lung  tumors  (31),  in  at  least  one  bronchial  epithe¬ 
lial  site  from  44%  of  lung  cancer  cases  examined  (32).  Moreover, 
pi 6  and  death- associated  protein  kinase  (DAPK)  promoter 
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cer  and  persisted  after  smoking  cessation  (32). 

The  aforementioned  molecular  abnormalities  were  detected 
in  histologically  normal  epithelia  adjacent  to  archival  surgically 
resected  tumors  from  patients  with  primary  lung  cancer.  LOH 
and  microsatellite  alterations  in  multiple  foci  were  also  detected 
in  distal  histological  normal  bronchial  epithelia  of  smokers  with¬ 
out  cancer  (19,  20).  Moreover,  and  importantly,  these  molecular 
abnormalities  were  detected  in  bronchial  epithelia  of  cancer- 
free  former  smokers  and  appeared  to  have  persisted  for  many 
years  after  smoking  cessation.  In  addition,  LOH  was  detected  in 
DNA  obtained  from  bronchial  brushings  of  normal  and  abnor¬ 
mal  lungs  from  patients  undergoing  diagnostic  bronchoscopy 
and  was  detected  in  cells  from  the  ipsilateral  and  contralateral 
lung  (33).  Mutations  in  TP53  were  also  described  to  occur  in 
bronchial  epithelia  of  cancer-free  smokers  in  a  widely  dispersed 
manner  (34).  Similar  evidence  also  exists  for  promoter  methyl- 
ation  and  epigenetic  changes  in  smoking-damaged  lung  epithe¬ 
lium  of  cancer-free  patients.  Methylation  of  various  genes, 
including  retinoic  acid  receptor  2  |3  ( RAR-f32 ),  H-cadherin ,  ad¬ 
enomatous  polyposis  coli  ( APC ),  pi  6,  and  Ras  association 
(RalGDS/AF-6)  domain  family  member  1  ( RASSFF1A )  has 
been  described  in  bronchial  epithelial  cells  of  heavy  smokers 

(35) .  Moreover,  methylation  of  pi 6,  glutathione  S-transferase 
pi  1  ( GSTP1 ),  and  DAPK  was  reported  to  be  evident  in  bron¬ 
chial  brushings  of  one-third  of  cancer-free  smokers  examined 

(36) .  In  the  study  by  Belinsky  and  colleagues,  as  mentioned  be¬ 
fore,  methylation  of  pi  6  was  detected  in  epithelia  of  cancer-free 
smokers  (32).  A  more  detailed  list  of  aberrant  gene  promoter 
methylation  in  patients  with  lung  cancer  and  cancer-free  smok¬ 
ers  is  well  summarized  and  explained  in  the  review  by  Heller 
and  co workers  (37). 
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study  the  transcriptome  of  lung  airways.  Hackett  and  colleagues 
studied  the  expression  of  44  antioxidant-related  genes  using 
bronchial  brushings  from  cancer-free  current  smokers  and 
never-smokers  and  found  significant  up-regulation  of  16  of  the 
antioxidant  genes  in  the  airways  of  smokers  compared  with  non- 
smokers  (38).  Later,  Spira  and  colleagues  described  global  alter¬ 
ations  in  gene  expression  between  normal-appearing  bronchial 
epithelium  of  healthy  cancer-free  smokers  and  that  of  non- 
smokers  (39).  Importantly,  irreversible  changes  in  expression 
in  airways  of  former  smokers  after  years  of  smoking  cessation 
were  described  that  were  believed  to  underlay  the  increased  risk 
former  smokers  exhibit  for  developing  lung  cancer  (39,  40). 
Alterations  in  the  expression  of  microRNAs  were  also  demon¬ 
strated  between  large  airways  of  current  and  never-smokers 
(41).  Notably,  an  80-gene  signature  was  derived  from  the  tran¬ 
scriptome  of  large  airway  epithelial  cells  that  can  distinguish 
smokers  without  overt  cancer  from  smokers  with  lung  cancer 
despite  originating  from  normal  bronchial  epithelia  (42).  More 
recently,  Gustafson  and  coworkers  derived  a  phosphoinositide- 
3-kinase  ( PI3K )  pathway  activation  signature  by  using  recombi¬ 
nant  adenoviruses  to  express  the  110a  subunit  of  PI3K  in  primary 
human  epithelial  cells  (43).  The  PI3K  pathway  activation  sig¬ 
nature  was  elevated  in  cytologically  normal  bronchial  airways  of 
smokers  with  lung  cancer  and,  importantly,  was  decreased  in  the 
airways  of  high-risk  smokers  whose  dysplastic  lesions  regressed 
after  treatment  with  the  PI3K  inhibitor  myoinositol  (43).  Micro¬ 
array  and  gene  expression  profiling  methodologies  were  also 
used  to  demonstrate  the  wide  anatomical  spread  of  the  lung 
field  cancerization.  Common  gene  expression  alterations  were 
identified  in  bronchial,  nasal,  and  buccal  epithelia  of  smokers 
(44),  and  in  a  separate  study,  the  expression  of  119  genes  was 
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Figure  1.  Molecular  mapping 
analysis  of  the  field  canceriza¬ 
tion  in  non-small  cell  lung  can¬ 
cer  (NSCLC).  The  relevance  of 
the  lung  field  cancerization  to 
the  development  of  a  partic¬ 
ular  subtype  of  NSCLC  (i.e., 
adenocarcinoma  compared  with 
squamous  cell  carcinoma  [SCC]), 
is  still  unknown,  yet  possible.  An¬ 
alyzing  local  and  distant  field  of 
cancerization  by  analysis  of  the 
transcriptome  of  airway  brush¬ 
ings  from  multiple  sites  indepen¬ 
dently  for  lung  adenocarcinoma 
(yellow  spots)  and  SCC  (red  spots) 
cases  may  shed  light  on  events 
common  or  unique  to  the  mo¬ 
lecular  pathogenesis  of  the  two 
major  subtypes  of  NSCLC.  A 
"compartmental"  approach  cou¬ 
pled  to  a  gradient  or  detailed 
molecular  mapping  method, 
which  spans  the  tumor  up  to 
the  nasal  or  buccal  epithelium, 
to  studying  the  field  of  canceriza¬ 
tion  may  unravel  biomarkers  that 
can  guide  personalized  preven¬ 
tion  strategies  suitable  for  each  different  NSCLC  subtype.  In  addition,  a  comparison  of  the  distant  field  cancerization  in  patients  with  cancer  (left)  to  the 
expression  patterns  of  the  corresponding  anatomical  location  in  disease-free  individuals  (e.g.,  high-risk  heavy  smokers;  right)  would  facilitate  the  development 
of  efficacious  markers  for  the  detection  of  NSCLC.  ADC  =  adenocarcinoma;  miRNA  =  microRNA. 
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Field  Cancerization  Compartmentalization 

In  light  of  the  prevalence  of  mutations  in  the  EGFR  oncogene  in 
adenocarcinomas  and  in  particular  those  occurring  in  never- 
smokers,  Tang  and  colleagues  investigated  the  presence  of 
EGFR  mutations  in  normal  bronchial  and  bronchiolar  epithe¬ 
lium  adjacent  to  EGFR  mutant  tumors  (22).  EGFR  mutations 
were  detected  in  histologically  normal  peripheral  epithelia  in 
44%  of  patients  with  lung  adenocarcinoma  with  mutations  but 
none  in  patients  lacking  mutations  in  the  oncogene  (22).  More¬ 
over,  the  same  study  highlighted  more  frequent  EGFR  muta¬ 
tions  in  normal  epithelium  within  the  tumor  (43%)  than  in 
adjacent  sites  (24%),  suggesting  a  localized  held  effect  phenom¬ 
enon  for  this  abnormality  in  the  respiratory  epithelium  of  the 
lung  (22).  In  addition,  a  higher  frequency  of  mutations  in  cells 
obtained  from  small  bronchi  (35%)  compared  with  bronchioles 
(18%)  was  detected  (22).  More  recently,  EGFR  protein  over¬ 
expression,  similar  to  mutation  of  the  gene,  also  exhibited  a  lo¬ 
calized  held  effect,  as  it  was  more  frequent  in  normal  bronchial 
epithelia  sites  within  tumors  than  in  sites  adjacent  to  and  distant 
from  tumors  (23).  Interestingly,  EGFR  copy  number  alteration 
was  not  evident  in  normal  bronchial  epithelia,  which  is  in  ac¬ 
cordance  with  hndings  that  EGFR  copy  number  is  a  relatively 
late  event  in  pathogenesis  of  adenocarcinomas  (5,  23).  These 
hndings  suggest  that  adenocarcinomas  may  be  associated  with 
a  held  cancerization  dissimilar  from  that  linked  to  SCCs. 

The  low  frequency  of  molecular  abnormalities  detected  in  the 
centrally  located  bronchial  respiratory  epithelium  in  patients 
with  peripheral  lung  adenocarcinomas,  compared  with  specimens 
from  patients  with  SCCs  (28),  suggests  the  presence  of  two  com¬ 
partments  in  the  lung  with  different  degrees  of  smoking-related 
genetic  damage.  Thus,  smokers  who  develop  SCCs  display  more 
smoking-related  genetic  damage  in  the  respiratory  epithelium  of 
the  central  airway,  whereas  patients  who  develop  adenocarci¬ 
nomas  exhibit  molecular  and  histological  damage  mainly  in  the 
peripheral  airways.  Although  some  molecular  changes  (e.g., 
inflammation  and  signaling  pathways  activation)  have  been 
detected  throughout  the  lung  airway  and  include  both  compart¬ 
ments  (central  and  peripheral  airway),  other  aberrations  have 
been  more  frequently  altered  in  either  central  (e.g.,  LOH,  ge¬ 
netic  instability  evidenced  by  microsatellite  repeats)  or  periph¬ 
eral  (e.g.,  EGFR  mutations  as  mentioned  above)  airways.  These 
interesting  observations  indicate  a  possible  compartmentaliza- 
tion  of  held  cancerization  and  its  dissimilarity  between  adeno¬ 
carcinomas  and  SCCs,  which  may  well  reflect  the  differential 
mechanisms  of  pathogenesis  of  both  NSCLC  subtypes. 

FUTURE  DIRECTIONS  AND  CONCLUSION 

Applying  the  same  advanced  high-throughput  methodologies 
currently  used  in  studying  established  tumors  for  the  genetic 
analysis  of  lung  NSCLC  pre-neoplasia  and  histologically  normal 
adjacent  regions  is  expected  to  expand  our  understanding  of  the 
biology  of  this  prevalent  disease.  Next-generation  sequencing 
technology,  through  whole-genome,  whole-exome,  and  whole- 
transcriptome  approaches,  holds  great  promise  for  providing  in¬ 
valuable  insights  into  NSCLC  biology,  diagnosis,  prevention,  and 
therapy  (46).  An  important  step  in  this  direction  was  a  recent 
study  in  which  RNA  of  bronchial  airway  epithelial  cell  brush- 
ings  from  healthy  never-smokers  and  smokers  with  and  without 
lung  cancer  was  analyzed  by  RNA  sequencing  (47)  and  provided 
additional  insight  besides  that  provided  when  using  microarray 
technology. 


and  perpetuate  differential  effects  on  the  airway  epithelia  (4). 
Changes  in  expression  in  the  lung  held  of  injury  have  been  shown 
to  be  similar  in  the  large  and  small  airways,  and  it  is  unknown 
whether  they  are  associated  with  the  development  of  the  partic¬ 
ular  subtype  of  NSCLC.  Addressing  this  question  may  be  highly 
important,  because  both  NSCLC  subtypes  display  different  ge¬ 
nomic  features  (2)  and,  therefore,  are  clinically  managed  by  sig¬ 
nificantly  dissimilar  treatment  strategies,  let  alone  differences 
among  various  subtypes  of  lung  adenocarcinomas.  Moreover, 
revisiting  the  held  cancerization  effect  using  a  compartmental 
coupled  with  a  gradient  or  detailed  molecular  mapping  approach 
in  patients  with  cancer  and  disease-free  individuals  (Figure  1) 
will  shed  light  on  events  in  the  early  pathogenesis  of  lung  adeno¬ 
carcinomas  and  SCCs  and  unravel  biomarkers  that  can  guide 
targeted  and  personalized  chemoprevention  strategies  suitable 
for  each  different  NSCLC  subtype  as  well  as  detection  efforts, 
in  particular  using  less  invasive  sites. 

Despite  numerous  efforts  that  have  centered  on  increasing  our 
understanding  of  the  biology  of  lung  cancer,  this  malignancy  still 
composes  the  biggest  share  of  cancer-related  deaths  in  the  United 
States  and  worldwide.  Compared  with  advances  in  targeted  and  per¬ 
sonalized  therapy  of  NSCLC,  little  progress  has  been  made  in  the 
tailored  prevention  of  this  fatal  malignancy.  This  may  change  with 
the  recent  encouraging  and  significant  hndings  of  the  National  Lung 
Screening  Trial  (11).  Various  molecular  markers  and  expression 
classifiers  previously  described  in  the  lung  airways  and  in  less- 
invasive  sites  of  held  cancerization  (e.g.,  nasal  epithelium)  can  aid 
in  selecting  high-risk  individuals  best  suited  for  computed  tomog¬ 
raphy  screening,  for  example.  A  comprehensive  analysis  of  early 
molecular  events  in  NSCLC  pathogenesis  will  undoubtedly  unravel 
biomarkers  that  can  guide  future  chemoprevention  strategies. 

Author  disclosures  are  available  with  the  text  of  this  article  at  www.atsjournals.org. 
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Introduction:  Understanding  oncogenes  and  tumor  suppressor 
genes  expression  patterns  is  essential  for  characterizing  lung  can¬ 
cer  pathogenesis.  We  have  previously  demonstrated  that  m Gprc5a! 
h GPRC5A  is  a  lung-specific  tumor  suppressor  evidenced  by  inflam¬ 
mation-mediated  tumorigenesis  in  Gprc5a- knockout  mice.  The 
implication  of  GPRC5A  in  human  lung  cancer  pathogenesis,  includ¬ 
ing  that  associated  with  inflammatory  chronic  obstructive  pulmonary 
disease  (COPD),  a  risk  factor  for  the  malignancy,  remains  elusive. 
Methods:  We  sought  to  examine  GPRC5A  immunohistochemi- 
cal  expression  in  histologically  normal  bronchial  epithelia  (NBE) 
from  lung  disease-free  never-  and  ever-smokers  (n  =  13  and  n=  18, 
respectively),  from  COPD  patients  with  ( n  =  26)  and  without  cancer 
(n  =  24)  and  in  non-small  cell  lung  cancers  (NSCLCs)  ( n  =  474). 
Quantitative  assessment  of  GPRC5 A  transcript  expression  in  airways 
(n  =  6),  adjacent  NBEs  ( n  =  29)  and  corresponding  tumors  ( n  =  6) 
from  6  NSCLC  patients  was  also  performed. 

Results:  GPRC5A  immunohistochemical  expression  was  signifi¬ 
cantly  lower  in  tumors  compared  to  uninvolved  NBE  (p  <  0.000 1 )  and 
was  positively  associated  with  adenocarcinoma  histology  ( p  <  0.001). 
GPRC5A  airway  expression  was  highest  in  lung  disease-free  NBE, 
decreased  and  intermediate  in  NBE  of  cancer-free  COPD  patients 
( p  =  0.004)  and  further  attenuated  and  lowest  in  epithelia  of  COPD 
patients  with  adenocarcinoma  and  SCC  (p  <  0.0001).  Furthermore, 
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GPRC5A  mRNA  was  significantly  decreased  in  NSCLCs  and  corre¬ 
sponding  NBE  compared  to  uninvolved  normal  lung  (p  =  0.03). 

Conclusions:  Our  findings  highlight  decreased  GPRC5A  expression 
in  the  field  cancerization  of  NSCLC,  including  that  associated  with 
lung  inflammation.  Assessment  of  the  use  of  GPRC5A  expression  as 
a  risk  factor  for  NSCLC  development  in  COPD  patients  is  warranted. 

KEY  WORDS:  Field  cancerization,  Chronic  obstructive  pulmonary 
disease,  Non-small-cell  lung  cancer,  g-protein  coupled  receptor 
family  C,  group  5,  member  A,  gene  expression. 

(J  Thorne  Oncol  2012;00:  00-00) 


Lung  cancer,  the  majority  of  which  is  non-small  cell  lung 
cancer  (NSCLC),  is  the  leading  cause  of  deaths  in  the 
United  States  and  worldwide.1  The  high  mortality  associ¬ 
ated  with  lung  cancer  is  in  part  because  of  late  diagnosis  after 
regional  or  distant  spread  of  the  disease.2  Improved  clinical 
management  of  NSCLC  is  tightly  linked  to  the  identification 
of  new  effective  early  biomarkers  that  can  spear  novel  strate¬ 
gies  for  early  detection,  prevention,  and  treatment.2’3 

The  majority  (85%)  of  diagnosed  NSCLC  cases  are 
attributable  to  cigarette  smoking.4’5  Auerbach  et  al. 6  earlier 
showed  that  tobacco  carcinogen  exposure  causes  multifocal 
and  clonal  histopathological  changes  in  the  airway  epithelia 
of  smokers,  suggestive  of  a  field  cancerization  in  the  lung.  In 
addition,  cigarette  smoking  perpetuates  inflammation  through¬ 
out  the  smoking-exposed  airway  epithelia  of  heavy  smokers,7 
which  was  suggested,  to  lead  to  onset  of  lung  cancer  develop¬ 
ment.8  Chronic  obstructive  pulmonary  disease  (COPD)  of  the 
lung  is  an  inflammatory  condition  that  is,  like  lung  cancer, 
causally  linked  to  cigarette  smoking910  and  is  a  major  cause 
of  mortality  in  the  United  States.11  Moreover,  preinvasive  lung 
cancer  lesions  are  common  (approximately  50%)  in  airways 
of  COPD  patients.12  Importantly,  although  phenotypically 
healthy  smokers  comprise  a  significant  population  at  risk  for 
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lung  cancer,5’7  COPD  increases  lung  cancer  risk  by  4.5-fold, 
more  than  either  age  or  quantity  of  smoking. 11,13,14  In  addi¬ 
tion,  33%  of  patients  with  mild-to-moderate  COPD  eventually 
die  because  of  lung  cancer  burden.11  Although  smoking  is  the 
main  cause  of  both  diseases,  only  10%  to  15%  of  smokers 
develop  COPD  and/or  lung  cancer,  suggesting  that  other  fac¬ 
tors,  including  genetic  variation  and  expression,  may  differ 
across  individuals  in  response  to  cigarette  smoke.4,5,9,10  It  is 
plausible  to  assume  that  understanding  common  expression 
patterns  between  both  diseases  for  subsequent  identification 
of  biological  markers  that  explain  risk  of  disease  onset  and/or 
progression  will  favorably  promote  the  clinical  management 
of  COPD-associated  lung  cancer.11 

Our  group  has  previously  demonstrated  that  loss  of 
G-protein  coupled  receptor,  family  C,  group  5,  member  A 
( mGprc5a/hGPRC5A ),  exemplified  by  the  Gprc5a- knockout 
mouse  model,  leads  to  chronic  inflammation  and  spontaneous 
lung  tumor  development.15,16  However,  the  expression  of  this 
tumor  suppressor  in  human  lung  NSCLCs,  including  those 
associated  with  inflammatory  conditions  such  as  COPD,  is 
unknown.  We  investigated  the  immunohistochemical  expres¬ 
sion  of  GPRC5A  in  a  large  series  of  NSCLC  histological  tis¬ 
sue  specimens  and  in  normal  bronchial  epithelia  (NBE)  from 
lung-disease-free  individuals,  COPD  patients  and  from  COPD 
patients  with  lung  cancer  and  its  transcript  expression  in  inde¬ 
pendent  resected  normal  epithelia  and  corresponding  tumors. 
Our  findings  reveal  marked  reduced  GPRC5A  expression  in 
human  NSCLC  and  in  the  histologically  normal  field  can- 
cerization  associated  with  COPD  and  lung  cancer.  Our  study 
pinpoints  to  a  potential  role  of  this  tumor  suppressor  in  the  pro¬ 
gression  of  COPD-associated  NSCLC,  which  warrants  further 
studies  to  assess  its  use  as  a  risk  marker  for  this  disease. 

MATERIALS  AND  METHODS 

NSCLC  Tissue  mMcroarrays  and  Normal 
Bronchial  Epithelial  Specimens 

Tissue  microarrays  (TMAs)  used  in  this  study  com¬ 
prised  474  surgically  resected  NSCLC  tumor  specimens  (308 
adenocarcinomas,  166  squamous  cell  carcinomas  [SCCs]) 
collected  under  an  Institutional  Review  Board  protocol  and 
archived  as  formalin-fixed  paraffin-embedded  (LLPE)  speci¬ 
mens  in  the  University  of  Texas  Specialized  Program  of 
Research  Excellence  thoracic  tissue  bank  at  the  University 
of  Texas  M.D.  Anderson  Cancer  Center.  Clinicopathological 
features  of  the  NSCLC  patients  examined  are  summarized  in 
Table  1.  TMAs  were  prepared  with  a  manual  tissue  arrayer 
(Advanced  Tissue  Arrayer  ATA100,  Chemicon  International, 
Temecula,  CA)  using  1-mm  diameter  cores  in  triplicate  for 
tumors,  as  described  previously.17  Histological  sections  4  pm 
in  thickness  were  then  prepared  for  subsequent  immunohisto- 
chemistry  analysis.  LLPE  specimens  of  NBE  from  50  patients 
with  COPD  with  forced  expiratory  volume  in  1  second/forced 
vital  capacity  ratio  of  70%  or  less,  a  smoking  history  of  30  or 
more  pack-years,  and  collected  at  Baylor  college  of  Medicine 
(Houston,  TX)  were  included  in  the  study,  including  24  can¬ 
cer-free  cases  and  26  cases  with  NSCLC.  In  addition,  NBE 
specimens  from  never  smokers  ( n  =  13)  and  ever  smokers 


TABLE  1.  Clinicopathological  Features  of  NSCLC  Patients  in 
Tissue  Microarray  Sets  Used  in  the  Study 

Covariate 

Levels 

n  (%) 

Sex 

Female 

240  (50.6) 

Male 

234  (49.4) 

Histology 

Adenocarcinoma 

308  (65.0) 

Squamous  cell  carcinoma 

166  (35.0) 

Stage 

I 

302  (63.7) 

II 

86(18.1) 

III  or  IV 

86(18.1) 

Grade* 

Well 

36  (10.4) 

Moderate 

193  (55.9) 

Poor 

116(33.6) 

Smoking  status 

Current 

178  (37.6) 

Former 

229  (48.3) 

Never 

67(14.1) 

Tobacco  history 

No 

67(14.1) 

Yes 

407  (85.9) 

*Information  on  differentiation  grade  was  available  in  a  subset  of  NSCLCs  analyzed. 
NSCLC,  non-small-cell  lung  cancer. 


(n  =  18)  without  lung  cancer  and  with  no  or  mild  airway 
obstruction  status  were  analyzed. 

Immunohistochemical  Analysis 

Polyclonal  antirabbit  antibodies  raised  against  human 
GPRC5A  were  generated  by  peptide  synthesis  and  site- 
directed  carrier  conjugation  using  keyhole  limpet  hemocyanin 
as  a  custom  service  by  Zymed  Laboratories  Inc.  (South 
San  Francisco,  CA).  The  synthetic  peptide  (cysteine)- 
PSPYKDYEVKKEGS-COOH,  corresponding  to  amino 
acids  344-357  in  the  human  GPRC5A  C-terminus,  was 
covalently  linked  to  keyhole  limpet  hemocyanin  via  an  added 
cysteine  residue,  and  polyclonal  antipeptide  antibodies  were 
prepared  in  rabbits.  Sera  were  confirmed  to  contain  high-titer 
antibodies  against  the  specific  peptide  using  enzyme-linked 
immunosorbent  assays.  TMAs  and  the  histologic  sections  (4 
pm  thick)  of  surgical  resected  specimens  were  deparaffinized 
and  hydrated,  and  antigen  retrieval  was  performed  using  a 
decloaker  with  Dako  target  retrieval  system  at  a  pH  of  6 . 0  (Dako 
North  America,  Inc.,  Carpinteria,  CA).  Intrinsic  peroxidase 
activity  was  blocked  by  3%  methanol  and  hydrogen  peroxide 
for  12  minutes  and  serum-free  protein  block  (Dako)  was  used 
for  30  minutes  for  blocking  nonspecific  antibody  binding. 
Slides  were  then  incubated  with  the  antibodies  against  human 
GPRC5A  (1 :200  dilution)  at  4°C  overnight.  After  three  washes 
in  Tris-buffered  saline,  slides  were  incubated  for  30  minutes 
with  Dako  Envision  +  Dual  Link  at  room  temperature.  After 
three  additional  washes,  slides  were  incubated  with  Dako 
chromogen  substrate  for  5  minutes  and  were  counterstained 
with  hemotoxylin  for  another  5  minutes.  FFPE  whole-section 
specimens,  except  for  the  omission  of  the  primary  antibodies, 
were  used  as  negative  controls.  The  intensity  and  extent  of 
cytoplasmic  and  nuclear  GPRC5A  immuno staining  were 
evaluated  using  a  light  microscope  (magnification,  x  20) 
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independently  by  two  pathologists  (JF  and  IIW).  GPRC5A 
immunoreactivity  was  mainly  cytoplasmic,  which  was 
quantified  using  a  four- value  intensity  score  (0,  none;  1,  weak; 
2,  moderate;  and  3,  strong)  and  the  percentage  (0-100%)  of 
the  extent  of  reactivity  A  final  cytoplasmic  expression  score 
was  obtained  by  multiplying  the  intensity  and  reactivity 
extension  values  (range,  0-300). 

CPRC5A  Transcript  Expression  in  Resected 
NSCLC  Specimens  and  Adjacent  Airway 
Epithelia 

GPRC5A  mRNA  expression  was  assessed  in  ever-smoker 
NSCLCs  and  their  uninvolved  normal  lung  parenchyma  tis¬ 
sues  as  well  as  in  NBE  collected  by  brushing  of  multiple 
corresponding  airways  from  the  same  lobectomy-  and  pneu¬ 
monectomy-resected  specimens.  Tumor  tissues  from  six  ever- 
smoker  NSCLCs  and  normal  lung  specimens  (n  =  6)  as  well  as 
NBE  by  brushings  (n  =  29)  from  the  same  patients  were  obtained 
under  an  Institutional  Review  Board  approved  protocol  in  which 
specimens  were  collected  from  patients  who  had  signed  letters 
of  consent.  NBE  were  collected  from  multiple  adjacent  to  tumor 
airways  by  brushing  each  site  independently  using  Cytosoft 
cytology  brushes  (Cardinal  Health,  Dublin,  OH).  Parallel 
brushes  were  used  for  touch-prep  for  cytological  assessment  by 
pan-cytokeratin  staining,  which  revealed  epithelial  content  more 
than  90%.  Normal  histology  was  determined  by  hematoxylin 
and  eosin  staining.  Brushes  were  immediately  placed  in  Qiazol 
lysis  reagent  (Qiagen,  Valencia,  CA)  and  stored  in  -80°C  until 
further  processing.  NSCLC  and  normal  lung  specimens  were 
shaved  for  histological  assessment  of  percentage  of  tumor  con¬ 
tent  and  malignant  cells  and  for  corresponding  RNA  isolation. 
Total  RNA  from  all  samples  was  purified  using  the  miRNeasy 
kit  (Qiagen)  according  to  the  manufacturer’s  instructions.  RNA 
quantity  was  determined  using  the  NanoDrop  spectrophotome¬ 
ter,  (Thermo  Scientific,  Wilmington,  DE),  and  quality  was  deter¬ 
mined  by  analysis  of  RNA  integrity  with  Agilent  Bioanalyzer 
2000  (Agilent  Technologies,  Santa  Clara,  CA). 

Quantitative-Real  Time  Polymerase 
Chain  Reaction 

Total  RNA  (150ng)  was  reverse-transcribed  using 
the  high-capacity  RNA-to-cDNA  kit  (Life  Technologies, 
Carlsbad,  CA)  according  to  the  manufacturer’s  instructions. 
Quantitative  real-time  polymerase  chain  reaction  (PCR)  was 
performed  using  TaqMan®  (Applied  Biosystems,  Foster  City, 
CA)  gene  expression  assays  for  GPRC5A  (Hs00173681_ml) 
and  beta-actin  ( ACTB )  (Hs99999903_ml)  primers  (Life 
Technologies)  on  a  7900HT  Fast-Real-Time  PCR  System 
(Life  Technologies)  according  to  the  manufacturer’s  instruc¬ 
tions.  All  samples  were  done  in  triplicates  and  normalized  to 
ACTB.  Relative  quantification  was  calculated  using  the  com¬ 
parative  cycle  threshold  method  as  previously  described.18 

Statistical  Analysis 

Summary  statistics,  including  frequency  tabulation, 
means,  standard  deviations,  median,  and  range,  were  given 
to  describe  patient  characteristics.  Wilcoxon  rank  sum  test 


or  Kruskal-Wallis  test  was  used  to  compare  GPRC5A 
immunohistochemical  expression  between  two  levels  or  among 
more  than  two  levels,  when  appropriate.  General  linear  model 
was  applied  to  test  the  difference  of  GPRC5  A  expression  across 
NBE  from  COPD-  and  cancer-free  never  and  ever  smokers, 
cancer-free  COPD  patients,  as  well  as  NBE  from  patients  with 
both  COPD  and  cancer.  Repeated  measures  analysis  was  used  to 
determine  significance  of  GPRC5A  transcript  variation  across 
matched  NSCLCs,  airways,  and  uninvolved  normal  lung.  All 
statistical  tests  were  two-sided,  and p  values  of  0.05  or  less  were 
considered  to  be  statistically  significant.  Statistical  analysis  was 
performed  with  standard  statistical  software,  including  SAS 
Release  9.1.3  for  Windows  and  S-Plus  8.0  for  Windows. 


RESULTS 

GPRC5A  Immunohistochemical  Expression  in 
Human  NSCLC 

We  have  previously  demonstrated  that  m Gprc5a! 
h GPRC5A  is  a  lung-specific  tumor  suppressor  evidenced 
by  spontaneous  tumor  development  in  mice  with  knockout 
of  this  gene  as  well  as  loss  of  the  transformed  phenotype  in 
malignant  NSCLC  cell  lines  after  GPRC5A  overexpression.16 
We  sought  to  examine  the  immunohistochemical  expression 
patterns  of  GPRC5A  tumor  suppressor  in  NSCLC,  which  is 
unknown,  and  its  association  with  clinicopathological  fea¬ 
tures  of  the  disease,  including  the  two  major  histologic  types 
examined,  adenocarcinoma  and  SCC.  The  clinicopatho¬ 
logical  characteristics  of  the  NSCLC  patients  from  which 
FFPE  tumor  specimens  were  analyzed  are  detailed  in  Table 
1.  Representative  photomicrographs  of  GPRC5A  immuno¬ 
histochemical  expression  in  human  lung  adenocarcinoma 
(left)  and  SCC  (right)  are  depicted  in  Figure  1  A.  GPRC5A 
immunoreactivity  was  mainly  cytoplasmic.  GPRC5A 
expression  was  significantly  lower  in  the  adenocarcinomas 
(n  =  308;  mean,  48.91  ±36;  median,  41.67;  minimum,  0; 
maximum,  180)  and  SCCs  ( n  =  166;  mean,  23.47±22.79; 
median,  18.33;  minimum,  0;  maximum,  110)  examined  com¬ 
pared  with  the  uninvolved  normal  bronchial  epithelia  (mean, 
164.52  ±34.04;  median,  160;  minimum,  100;  maximum, 
230)  from  lung-disease-free  smokers  and  never  smokers 
{p  <  0.0001)  (Fig.  IB).  Moreover,  GPRC5A  immunohisto¬ 
chemical  expression  was  significantly  lower  in  SCCs  com¬ 
pared  with  that  in  adenocarcinomas  (p  <  0.0001)  (Fig.  IB). 
We  then  correlated  GPRC5A  with  other  clinicopathological 
features  for  all  NSCLCs.  Notably,  GPRC5A  expression  was 
significantly  increased  in  NSCLC  tumor  specimens  from 
never  smokers  (n  =  67)  (mean,  53.94 ±39.94;  median,  40; 
minimum,  0;  maximum,  163.33)  compared  with  ever  smokers 
(n  =  407)  (mean,  37.71  ±32. 65;  median,  30;  minimum,  0;  max¬ 
imum,  180)  (p  =  0.001)  (Fig.  IQ.  GPRC5A  expression  was 
also  significantly  different  among  never,  former  and  current 
smoker  NSCLCs  (p  =  0.003)  (Fig.  ID).  In  addition,  GPRC5A 
expression  was  significantly  positively  associated  with  well- 
differentiated  tumor  grade  in  a  subset  of  NSCLC  tumors 
(n  =  346)  examined  with  available  grade  information  ( p  =  0.004) 
(Fig.  IE).  It  is  important  to  note  that  when  we  examined  each 
histology  type  separately,  there  were  no  statistically  significant 
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FIGURE  1.  GPRC5A  immunohistochemical  expression  in  NSCLC  histological  tissue  specimens.  A,  Representative  photomi¬ 
crographs  of  GPRC5A  immunohistochemical  expression  in  lung  adenocarcinoma  (left)  and  SCC  (right)  observed  under  a  light 
microscope  at  a  magnification  of  20  x.  Bars,  100  pm.  B,  Box-plots  depicting  differences  in  GPRC5A  immunohistochemical 
expression  among  NBE,  Adeno,  and  SCCs.  C,  Box-plots  depicting  GPRC5A  expression  based  on  tobacco  history  (never  ver¬ 
sus  ever  smokers),  (D),  smoking  status  (never,  former  and  current),  and  (£),  differentiation  grade  (well,  moderate,  and  poor). 
p  values  based  on  the  Wilcoxon  rank  sum  test  and  Kruskal-Wallis  test.  GPRC5A,  G-protein  coupled  receptor  family  C,  group 
5,  member  A;  NSCLC,  non-small-cell  lung  cancer;  SCC,  squamous  cell  carcinomas;  NBE,  normal  bronchial  epithelia;  Adeno, 
adenocarcinomas. 


correlations  between  GPRC5  A  expression  and  patient’s  smok¬ 
ing  history  and  status,  and  tumor-differentiation  grade. 

GPRC5A  Immunohistochemical  Expression 
in  NBE  from  COPD  Patients 

We  have  previously  shown  that  loss  of  m Gprc5a/ 
h GPRC5A  tumor  suppressor  leads  to  chronic-inflammation- 
mediated  prosurvival  signaling  and  transformation  of 
nonmalignant  lung  epithelial  cells.15  The  relevance  of 
GPRC5A  expression  to  human  NSCLC  pathogenesis, 
including  that  associated  with  COPD,  a  chronic  inflammatory 
condition  and  risk  factor  for  lung  cancer,  remains  elusive.  In 
light  of  the  strong  association  between  inflammation,  COPD 
and  NSCLC,10’11  we  sought  to  examine  the  expression  of 
GPRC5A  NBE  in  COPD-associated  adenocarcinoma  and 
SCC.  GPRC5A  expression  was  assessed  in  histological  tissue 
specimens  of  NSCLC  and  NBE,  including  those  from  lung 
cancer- free  COPD  and  from  NSCLC  patients.  Representative 
photomicrographs  of  GPRC5A  immunohistochemical 
staining  in  the  different  NBE  are  depicted  in  Figure  2 A. 
GPRC5A  immunore activity  seemed  to  be  highest  in  NBE 
from  lung-disease-free  never  and  ever  smokers,  successively 
significantly  decreased  in  NBE  of  COPD  patients  who  are 
cancer-free,  and  lowest  in  NBE  of  patients  with  both  COPD 
and  adenocarcinoma  or  SCC  (Fig.  2 B).  A  general  linear  model 


demonstrated  a  significant  gradual  decrease  of  GPRC5A 
expression  from  NBE  of  disease-free  never  smokers  to 
NBE  from  patients  with  COPD  and  adenocarcinoma  or 
SCC  (p  <  0.0001)  (Fig.  2 B).  The  mean  (177.31  ±30.66)  and 
median  (195;  minimum,  130;  maximum,  220)  of  GPRC5A 
expression  in  NBE  of  lung-disease-free  never  smokers 
were  higher  than  that  of  smokers  (mean,  155. 2 8± 34. 15, 
median,  147.5;  minimum,  100;  maximum,  230)  although 
the  differences  were  not  statistically  significant  (Fig.  2 B). 
Moreover,  GPRC5A  expression  was  significantly  higher 
in  NBE  of  lung-disease-free  smokers  relative  to  NBE  in 
cancer-free  COPD  patients  (mean,  85.48 ±53.55,  median, 
72.88;  minimum,  20;  maximum,  211)  (p  =  0.007),  which  in 
turn  was  significantly  higher  compared  with  the  expression 
in  NBE  from  COPD  patients  with  adenocarcinoma  (mean, 
48. 15  ±52.84,  median,  35;  minimum,  0;  maximum,  170) 
( p  <  0.0001)  or  SCC  (mean,  32.24 ±34.9,  median,  12.5; 
minimum,  0;  maximum,  101.69)  (p  <  0.0001)  (Fig.  2 B). 
Furthermore,  although  GPRC5A  expression  was  lower  in 
NBE  of  COPD  patients  with  SCC  compared  with  those 
with  COPD  and  adenocarcinoma,  the  difference  was  not 
statistically  significant  (Fig.  2 B).  These  findings  suggest  that 
reduced  expression  of  the  GPRC5A  tumor  suppressor  may 
be  implicated  in  the  pathogenesis  of  NSCLC  associated  with 
inflammatory  COPD. 
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FIGURE  2.  Reduced  GPRC5A  expression  in  the  sequential  pathogenesis  of  COPD-associated  NSCLC.  A,  Representative 
photomicrographs  of  GPRC5A  immunohistochemical  expression  in  NBE  from  disease-free  never  and  ever  smokers  (upper 
left  and  upper  right,  respectively)  and  in  NBE  from  cancer-free  COPD  patients  (bottom  left)  and  NSCLC  patients  (bot¬ 
tom  right)  with  COPD  observed  under  a  light  microscope  at  a  magnification  of  x  20.  Bars,  1  00  pm.  B,  Box-plots  depicting 
quantification  of  GPRC5A  immunohistochemical  expression  represented  in  (A).  Main  p  value  signifies  statistical  significance 
of  reduced  GPRC5A  expression  in  the  sequential  pathogenesis  of  COPD-associated  NSCLC  and  among  the  five  groups  by 
the  general  linear  model  and  Kruskal-Wallis  test,  p  values  signifying  the  indicated  pair  wise  comparisons  among  the  groups 
were  obtained  by  the  Wilcoxon  rank  sum  test.  GPRC5A,  G-protein  coupled  receptor  family  C,  group  5,  member  A;  NBE, 
normal  bronchial  epithelia;  NSCLC,  non-small-cell  lung  cancer;  COPD,  chronic  obstructive  pulmonary  disease;  SCC,  squa¬ 
mous  cell  carcinomas;  NS,  not  significant. 


We  then  examined  GPRC5A  expression  in  epithelial 
cells  from  different  compartments  of  the  lung  (bronchial, 
bronchiolar  and  alveolar  epithelia  and  cells).  GPRC5A 
expression  was  statistically  significantly  lower  in  bronchi 
and  bronchioles  from  cancer-free  COPD  patients  (p  <  0.005) 
and  patients  with  both  COPD  and  SCCs  or  adenocarcinomas 
( p  <  0.0001)  compared  with  lung-disease-free  never  smokers 
(Supplementary  Table  1,  Supplemental  Digital  Content  1, 
http://links.lww.com/JTO/A334).  In  contrast,  GPRC5A 
expression  in  alveolar  compartment  remained  high  (score 
average  172.2-261.5)  in  all  groups  of  cases  examined,  and 
there  were  no  statistically  significant  differences  based  on 


COPD  status  (Supplementary  Table  1,  Supplemental  Digital 
Content  1,  http://links.lww.com/JTO/A334). 

CPRC5A  Expression  in  Smoker  NSCLC  Field 
Cancerization 

Our  previous  findings  on  GPRC5A  tumor-suppressive 
function15’16’19  as  well  as  our  current  observation  on  significant 
and  progressive  reduced  expression  of  this  tumor  suppressor  in 
NBE  from  lung  cancer-free  COPD  patients  and  patients  with 
NSCLC,  prompted  us  to  probe  whether  GPRC5A  expression 
exhibits  a  field  cancerization  pattern  in  NSCLC.  We  sought 
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to  assess  GPRC5A  transcript  expression  in  multiple  bronchial 
sites  ( n  =  29)  as  well  as  in  the  tumors  ( n  =  6)  and  uninvolved 
normal  lung  parenchyma  ( n  =  6)  in  six  NSCLC  cases  (3  SCCs 
and  3  adenocarcinomas).  Quantitative  real-time  PCR  analysis 
demonstrated  progressively  reduced  GPRC5A  expression  from 
normal  uninvolved  lung  parenchyma  (relative  quantification 
[RQ]  mean,  4. 18 ±0.62;  median,  4.16;  minimum,  3.51;  maxi¬ 
mum,  5.04)  to  histologically  normal  and  adjacent  bronchial  epi- 
thelia  (RQ  mean,  3.12±1.94;  median,  2.86,;  minimum,  0.59; 
maximum,  10.12)  to  NSCLC  tumors  (RQ  mean,  1.35 ±0.71; 
median  1.31;  minimum  0.24;  maximum  2.30),  which  exhibited 
on  an  average  the  lowest  expression  of  the  gene  (Fig.  3 A  and  B ) 
( p  =  0.03).  These  findings  suggest  that  GPRC5A  expression  is 
reduced  in  the  smoking-injured  field  cancerization  in  NSCLC 
highlighting  a  potentially  strong  implication  of  this  tumor  sup¬ 
pressor  in  the  pathogenesis  of  this  malignancy. 

DISCUSSION 

In  this  study,  we  assessed  the  expression  of  GPRC5A, 
which  we  had  found  to  exhibit  lung-specific  tumor- suppres¬ 
sor  properties  in  mice,15,16’19  in  human  NSCLC  histological 
tissue  specimens,  as  well  as  in  NBE  from  COPD  and  lung 
adenocarcinoma  and  SCC  patients.  We  found  that  the  protein 
product  of  human  GPRC5A  was  lower  in  lung  tumors  com¬ 
pared  with  uninvolved  histologically  NBE,  and  was  signifi¬ 
cantly  associated  with  adenocarcinoma  histology.  GPRC5A 
airway  expression  seemed  to  exhibit  a  progressive  decrease  in 
the  field  cancerization  of  COPD-related  NSCLC  with  highest 
expression  in  bronchial  epithelia  obtained  from  disease-free 
individuals,  intermediate  immunoreactivity  in  normal  epithe¬ 
lia  from  cancer- free  COPD  patients,  and  lowest  in  normal  epi¬ 
thelia  from  patients  with  both  COPD  and  adenocarcinoma  and 
SCC  histologies.  Furthermore,  GPRC5A  transcript  expression 
was  also  lower  in  histologically  NBE  as  well  as  in  adjacent 
and  corresponding  smoker  NSCLC  tumors,  irrespective  of 


COPD  status,  compared  with  uninvolved  normal  lung  sugges¬ 
tive  of  a  field  cancerization-mediated  expression  modulation 
of  this  G-protein  coupled  receptor.  In  light  of  the  relevance 
of  the  field  cancerization  phenomenon  to  NSCLC  pathogen¬ 
esis, 5_7>20>21  our  findings  pinpoint  to  a  potential  tumor- suppres¬ 
sive  role,  similar  to  that  established  in  mice,  of  GPRC5A  in 
the  sequential  development  of  human  NSCLC,  in  particular 
those  associated  with  inflammatory  chronic  obstructive  dis¬ 
ease.  NSCLC  and  COPD  are  both  mainly  attributable  to  ciga¬ 
rette  smoke910,13  and  are  leading  causes  of  deaths  in  the  United 
States  and  worldwide.1’2’11  Given  that  COPD  is  a  major  risk 
factor  for  lung  cancer  and  shares  various  pathogenic  features 
with  lung  tumors,9  10  22-25  better  molecular  markers  are  needed 
to  identify  which  COPD  patients  will  continue  to  develop  lung 
malignancies.11  Thus,  the  results  herein  raise  the  intriguing 
possibility  that  GPRC5A  loss  may  be  a  useful  biomarker  in 
assessing  the  risk  of  NSCLC  development  in  COPD  patients. 

COPD  and  infections  as  well  as  inflammatory  disorders 
of  the  respiratory  tract  may  be  linked  to  an  increased 
risk  of  lung  cancer.26’27  We  have  previously  demonstrated 
significantly  increased  macrophage  infiltration  into  lungs  of 
Gprc5a- knockout  mice  along  with  their  direct  association 
with  adenocarcinomas,  which  was  accompanied  by  higher 
constitutive  levels  of  proinflammatory  cytokines  and 
chemokines  and  increased  susceptibility  to  stimulation  of 
nuclear  factor-kappa  B  (NF-kB)  activation  in  vivo.15  We  also 
showed  that  loss  of  Gprc5a-mediated  activation  of  NF-kB 
was  causally  linked  to  macrophage  recruitment  and  enhanced 
inflammation  toward  the  creation  of  a  tumor-promoting 
microenvironment.15  It  is  noteworthy  that  lungs  of  patients 
with  severe  COPD  exhibit  macrophage/CD 8+  T  cell  and 
neutrophil  infiltration  based  on  histopathological  studies  and 
bronchial-biopsies.28  Moreover,  it  is  worthwhile  to  mention 
that  we  attempted  to  statistically  assess  the  association 
between  expression  of  GPRC5A  and  that  of  NF-kB  (nuclear 
p65  immunoreactivity)  in  NSCLCs  and  found  a  significant 


FIGURE  3.  Decreased  GPRC5A  mRNA  expression 
in  histologically  NBE  and  corresponding  NSCLC 
tumors  from  resected  specimens  compared  with 
matched  uninvolved  normal  lung.  Total  RNA  was 
isolated  from  brushings  of  NBE  and  frozen  sec¬ 
tions  of  matched  NSCLC  (T)  and  uninvolved  NL 
parenchyma  obtained  from  resected  lobectomy  or 
pneumonectomy  specimens  performed  on  smoker 
early-stage  patients.  A,  GPRG5A  expression  was 
analyzed  in  all  samples  (in  triplicates)  by  QRTPCR 
and  normalized  against  that  of  AGTB  to  yield  a 
relative  expression  by  the  2-AACT  method  depicted 
in  the  graphs.  NSCLC  tumors  (squamous  cell 
carcinoma,  cases  1-3,  and  adenocarcinomas,  cases 
4-6),  uninvolved  NL  parenchyma  and  bronchial 
structures  are  labeled  by  the  indicated  colors.  B, 
GPRG5A  transcript  expression  among  uninvolved 
NL  tissue,  NBE,  and  NSCLC  tumors  was  assessed 
statistically  by  repeated  measures  analysis.  NBE, 
normal  bronchial  epithelia;  NSCLC,  non-small-cell 
lung  cancer;  NL,  normal  lung;  CT,  comparative 
cycle  threshold  method.;  QRTPCR,  quantitative 
real-time  polymerase  chain  reaction. 
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inverse  correlation  between  both  proteins  in  particular 
in  adenocarcinomas  (data  not  shown).  In  addition,  it  was 
recently  shown  that  the  incidence  of  hyperplastic  lesions  in 
lungs  of  Gprc5a-knockout  mice  was  increased  after  exposure 
to  nontypeable  Haemophilus  influenzae.29  Notably,  bacterial 
colonization,  particularly  with  nontypeable  Haemophilus 
influenzae,  has  been  implicated  as  a  cause  of  airway 
inflammation  in  COPD  besides  cigarette  smoke.30  In  this 
context,  the  commonalities  between  inflammatory-mediated 
and  histopathological  mechanisms  in  Gprc5a- knockout  mice 
and  those  evident  in  lungs  of  COPD  patients  along  with  our 
current  findings  on  reduced  GPRC5A  expression  in  normal 
cells  from  COPD  patients  pinpoint  to  a  tumor-suppressive  role 
of  this  gene  in  COPD-associated  human  lung  tumorigenesis. 

We  previously  cloned  the  retinoid-regulated  m Gprc5a! 
h GPRC5A  and  found  that  it  was  preferentially  expressed  in  fetal 
and  adult  mouse  and  human  lung  tissue  compared  with  nor¬ 
mal  specimens  from  other  anatomically  distinct  organs.31  The 
functional  relevance  of  this  lung-specific  expression  was  high¬ 
lighted  by  our  earlier  study  demonstrating  spontaneous  devel¬ 
opment  of  adenomas  and  adenocarcinomas  in  Gprc5a- nockout 
mice,  which  was  not  observed  in  wild-type  littermates.  In  addi¬ 
tion,  spontaneous  tumor  development  in  Gprc5a- knockout 
mice  was  characterized  by  late  tumor  onset  (12- 16-month-old 
mice)  and  low  multiplicity.16  Notably,  we  reported  that  expo¬ 
sure  to  the  tobacco-specific  carcinogen  4-(methylnitrosamino)- 
l-(3-pyridyl)-l-butanone  (nicotine-derived  nitrosamine  ketone) 
augmented  tumorigenesis  evidenced  by  6-month  earlier  devel¬ 
opment  of  lesions,  two-  to  fourfold  increased  tumor  incidence 
and  multiplicity,  as  well  as  a  dramatic  increase  in  lesion  size.19 
It  is  worthy  to  note  that,  in  this  study,  we  found  significantly 
reduced  expression  of  GPRC5A  in  human  smoking-injured 
histologically  NBE  adjacent  to  lung  tumors  and  in  the  local¬ 
ized  field  cancerization  of  smoker  NSCLC  patients,  compared 
with  uninvolved  normal  lung.  Moreover,  reduced  GPRC5A 
expression  was  concomitant  among  airways  and  correspond¬ 
ing  NSCLC  compared  with  matched  normal  lung  tissue.  It  is 
plausible  to  assume  that  smoking-mediated  reduced  airway 
expression  of  GPRC5A  in  patients  may  explain  our  previous 
observations  on  augmented  tumorigenesis  in  Gprc5a- knockout 
mice  after  exposure  to  tobacco  carcinogens.  Because  COPD  is 
causally  linked  to  smoking,9-11’22’23  it  cannot  be  neglected  that 
reduced  GPRC5A  expression  in  NBE  from  COPD  patients 
may  be  because  of  the  molecular  impact  of  smoking  on  the 
airway  transcriptome.  However,  it  is  important  to  mention 
that  we  found  significantly  decreased  GPRC5A  expression  in 
normal  epithelia  from  COPD  patients  compared  with  epithe- 
lia  obtained  from  disease-free  smokers.  Moreover,  GPRC5A 
expression  was  further  decreased  and  lowest  in  normal  airway 
epithelia  from  COPD  patients  with  NSCLC  compared  with 
those  from  cancer-free  COPD  patients.  Thus  in  this  context, 
our  findings  raise  the  possibility  that  GPRC5A  expression  may 
be  impacted  by  cigarette  smoking  and  may  be  both  a  cause  and 
consequence  of  increased  inflammation  in  tumor-promoting 
microenvironment  in  the  lung  as  well  as  NSCLC  development, 
which  may  be  better  delineated  in  future  warranted  studies. 

Our  group  has  previously  shown  that  human  lung  and 
head  and  neck  cancer  cell  lines  exhibit  reduced  GPRC5A 
expression  compared  with  their  normal  counterparts.31  It  is 


worthwhile  to  mention  that  treatment  of  cancer  cells  with 
all-trans  retinoic  acid  restored  GPRC5A  expression  in  cells 
devoid  of  the  tumor  suppressor  in  part  by  a  retinoic  acid 
receptor-dependent  manner.31’32  In  addition  and  in  a  sepa¬ 
rate  study,  we  have  previously  noted  reduced  retinoic  acid 
receptor  beta  (RAR|3)  in  approximately  50%  of  adenocar¬ 
cinomas  and  SCCs  as  well  as  loss  of  RARy  and  retinoid  X 
receptor (3  in  a  significant  fraction  of  the  tumors.33  Moreover, 
RAR|3  expression  is  decreased  in  bronchial  epithelia  and 
preneoplastic  lesions  representing  the  sequence  of  lung  SCC 
pathogenesis.34  More  recently,  we  demonstrated,  using  gene 
expression  profiling,  that  loss  of  murine  Gprc5a  in  lung  epi¬ 
thelial  cells  reduced  expression  of  markers  of  squamous  dif¬ 
ferentiation  concomitantly  with  an  increase  in  mediators  of 
the  inflammatory  process  (e.g.,  NF-kB  signaling).  In  light 
of  the  biological  connection  between  retinoid  signaling  and 
GPRC5A  expression  as  well  as  our  current  findings  in  this 
revised  article  on  the  significant  reduced  expression  of  the 
tumor  suppressor  in  NBE  of  COPD  patients  including  those 
with  NSCLC,  it  is  plausible  to  suggest  that  retinoid  signal¬ 
ing  may  be  aberrantly  regulated  in  pathogenesis  of  COPD- 
associated  NSCLC.  It  is  important  to  note  that  retinoids  are 
currently  being  tested  for  their  lung  regenerative  properties 
and  effects  in  patients  with  COPD.35 

In  conclusion,  we  report  herein  decreased  expression  of 
GPRC5A  in  human  NSCLC  tissue  compared  with  uninvolved 
NBE  and  in  the  field  cancerization  of  smoker  NSCLCs  pin¬ 
pointing  to  a  tumor-suppressor  role  of  this  G-protein  coupled 
receptor  in  the  pathogenesis  of  this  leading  cause  of  cancer- 
related  deaths.  Moreover,  our  findings  highlight  a  progressive 
decrease  in  GPRC5A  expression  in  the  sequential  pathogen¬ 
esis  of  NSCLCs  arising  in  COPD  patients,  warranting  future 
studies,  including  the  analysis  of  lung  cancer  preneoplastic 
lesions,  to  assess  the  potential  of  the  utility  of  this  gene  as  a 
biomarker  for  lung  cancer  risk  in  COPD  patients. 
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Abstract 


Gene  expression  alterations  in  response  to  cigarette  smoke  have  been  characterized  in  normal¬ 
appearing  bronchial  epithelium  of  healthy  smokers  and  it  has  been  suggested  that  adjacent 
histologically  normal  tissue  display  tumor-associated  molecular  abnormalities.  We  sought  to 
delineate  the  spatial  and  temporal  molecular  lung  field  of  injury  in  smoker  early  stage  non-small 
cell  lung  cancer  (NSCLC)  patients  (n=19)  who  were  accrued  into  a  surveillance  clinical  trial  for 
annual  follow-up  and  bronchoscopies  within  one  year  after  definitive  surgery.  Bronchial 
brushings  and  biopsies  were  obtained  from  six  different  sites  in  the  lung  at  the  time  of  inclusion 
in  the  study  and  at  12,  24  and  36  months  after  the  first  time  point.  Affymetrix  Human  Gene  1.0 
ST  arrays  were  used  for  whole-transcript  expression  profiling  of  airways  (n=391).  Microarray 
analysis  identified  gene  features  (n=  1 1 65)  that  were  non-uniform  by  site  and  differentially 
expressed  between  airways  adjacent  to  tumors  relative  to  more  distant  samples  as  well  as  those 
(n=1395)  that  were  significantly  altered  with  time  up  to  three  years.  In  addition,  gene-interaction 
networks  mediated  by  PI3K  and  ERK1/2  were  modulated  in  adjacent  compared  to  contralateral 
airways  and  the  latter  network  with  time.  Furthermore,  phosphorylated  AKT  and  ERK1/2 
immunohistochemical  expression  were  significantly  increased  with  time  (nuclear  pAKT,  p=0.03; 
cytoplasmic  pAKT,  p<0.0001;  pERKl/2,  p=0.02)  and  elevated  in  adjacent  compared  to  more 
distant  airways  (nuclear  pAKT,  p=0.04;  pERKl/2,  p=0.03).  This  study  highlights  spatial  and 
temporal  cancer-associated  expression  alterations  in  the  molecular  field  of  injury  of  early  stage 
NSCLC  patients  after  definitive  surgery  that  warrant  further  validation  in  independent  studies. 
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Introduction 


Lung  cancer,  of  which  non-small  cell  lung  cancer  (NSCLC)  comprises  the  majority,  is 
the  leading  cause  of  cancer-related  deaths  in  the  US  and  worldwide  (1,  2).  The  high  mortality  of 
this  disease  is  in  part  due  to  the  late  diagnosis  of  the  majority  of  lung  cancers  after  regional  or 
distant  spread  of  the  malignancy  (3).  Recent  data  from  the  National  Lung  Screening  Trial  (4), 
indicating  that  screening  increases  early  detection  rates,  are  expected  to  augment  the  number  of 
early  stage  NSCLC  detected  warranting  the  need  for  better  clinical  management  of  this  growing 
subpopulation.  Besides  adjuvant  therapy,  there  are  no  effective  chemoprevention  strategies  for 
early  stage  NSCLC  patients  (5)  who  comprise  approximately  50%  of  all  diagnosed  cases  and 
have  a  relatively  high  rate  of  relapse  (3).  Improved  clinical  management  of  early  stage  NSCLC  is 
tightly  linked  to  the  identification  of  new  effective  early  biomarkers  that  can  guide  potential 
chemoprevention  strategies. 

Most  diagnosed  NSCLC  cases  (85%)  are  attributable  to  cigarette  smoking  (6-8). 
Auerbach  et  al  found  that  cigarette  smoke  induces  extensive  histological  changes  in  the 
bronchial  epithelia  in  the  lungs  of  smokers  and  that  premalignant  lesions  are  widespread  and 
multifocal  throughout  the  respiratory  epithelium,  suggestive  of  a  field  effect  (9).  Many  molecular 
abnormalities,  such  as  loss  of  heterozygosity  (LOH)  (10-12),  mutations  in  TP 5 3  (13), 
methylation  of  pi  6  tumor  suppressor,  death  associated  protein  kinase  ( DAPK)  and  retinoic  acid 
receptor  2  beta  (. RAR-/32 )  were  detected  in  bronchial  epithelia  of  cancer-free  former  smokers  (14- 
18)  some  of  which  persisting  for  many  years  after  smoking  cessation  (15).  More  recently,  global 
mRNA  and  microRNA  (miRNA)  expression  profiles  have  been  described  in  the  normal¬ 
appearing  bronchial  epithelium  of  healthy  smokers  (19,  20)  that  are  different  from  those  in  non- 
smokers.  Moreover,  expression  and  pathway  signatures  have  been  derived  from  normal 
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bronchial  epithelium  of  smokers  that  exhibited  diagnostic  properties  (21,  22).  Molecular  changes 
involving  LOH  of  chromosomal  regions  3p  (DDUT  and  FHIT  genes),  9p  ( CDKN2A ),  genomic 
instability  (increased  microsatellite  repeats)  and  pi 6  methylation  have  been  demonstrated  in 
histologically  normal  epithelium  in  squamous  cell  carcinoma  patients  and  in  the  sequence  of 
pathogenesis  of  the  disease  (14,  23,  24).  Moreover,  Nelson  et  al  demonstrated  that  KRAS  is  also 
mutated  in  histologically  normal  lung  tissue  adjacent  to  lung  tumors  (25).  Furthermore,  Tang  and 
colleagues  found  higher  rates  of  EGFR  mutations  in  adjacent  normal  bronchial  epithelia  (26,  27) 
suggestive  of  a  potential  localized  field  effect. 

It  is  plausible  to  assume  that  understanding  early  molecular  aberrations  in  histologically 
normal  smoke-damaged  airway  epithelium  of  early  stage  patients  would  serve  as  a  critical  first 
step  towards  identification  of  biomarkers  that  can  guide  lung  cancer  prevention  strategies. 
However,  the  global  molecular  airway  field  of  injury  in  early  stage  NSCLC  patients,  in  particular 
after  definitive  surgery,  is  unknown.  In  this  study,  we  used  transcript-level  expression  profiling 
coupled  with  gene-interaction  network  analysis  and  immunohistochemical  analysis  to 
characterize,  in-depth,  site-  and  time-dependent  global  molecular  alterations  in  airways  of 
smoker  early  stage  NSCLC  patients. 

Materials  and  Methods 

Patient  population  and  airway  epithelial  cell  collection.  Early  stage  (I/II),  current  or  former 
smoker  NSCLC  patients  with  at  least  a  10-pack-year  smoking  history  and  without  evidence  of 
disease  after  definitive  surgery  were  recruited  into  the  Vanguard  phase  II  surveillance  clinical 
trial  (clinical  trial  number  NCT00352391)  within  one  year  from  time  of  surgery.  Patients  were 
accrued  between  2004-2008  and  underwent  frequent  testing  including  chest  x-rays,  CT  scans, 
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laboratory  work,  serologies,  flexible  bronchoscopies  and  airway  biopsy  collections  within  one 
year  from  surgery  (average,  6  months;  range,  1-12)  (first  time  point),  and  at  months  12,  24,  and 
36  following  the  first  time  point.  Bronchoscopies  were  performed  using  white  light  or  both  white 
light  and  autofluorescence  modalities.  Biopsies  were  obtained  from  all  potential  anatomical 
locations  and  time  points  per  patient.  In  total  there  were  324  evaluable  airway  biopsies. 
Histological  assessment  was  performed  to  determine  whether  malignant  changes  will  occur 
during  the  time  period.  Once  patients  have  completed  3  years  of  testing,  they  were  followed  until 
the  study  is  completed.  Patients  were  comprised  of  former  (n=16)  and  current  (n=3)  smokers. 
One  of  the  three  current  smoker  patients  quit  smoking  6  months  before  the  24  month  time  point. 
The  clinicopathological  variables  of  patients  in  this  study  are  summarized  in  Table  1.  The  study 
was  approved  by  the  Institutional  Review  Boards  and  all  participants  provided  written  informed 
consents. 

Bronchial  airway  epithelial  cells  were  obtained  from  five  to  six  different  sites  (main 
carina,  4  airways  from  4  lobes  and  the  bronchial  stump  or  main  stem  bronchus  adjacent  to  the 
originally  resected  tumor  and  lobe,  Figure  1)  at  each  time  point  using  an  Olympus  fiberoptic 
bronchoscope  (Olympus  America  Inc.,  Center  Valley,  PA)  and  cytobrushes  (Cellebrity 
Endoscopic  Cytobrush,  Boston  Scientific,  Boston,  MA).  Patients  (n=19)  with  samples/specimens 
available  for  analysis  that  were  obtained  serially  up  to  either  24  or  36  months  and  from  at  least 
from  four  different  sites  in  the  lung  at  each  time  point  (n=391  airway  samples)  were  selected  for 
the  study.  Epithelial  cell  content  was  confirmed  by  cytokeratin  staining  which  yielded  a  90% 
epithelial  cell  content  mainly  comprised  of  ciliated  cells.  Brushes  were  immediately  placed  in 
serum-free  RPMI  medium  on  ice,  vortexed  gently  to  disperse  epithelia  into  the  media  and  then 
removed.  Samples  were  then  immediately  centrifuged  and  cell  pellets  were  resuspended  in  1  ml 
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of  phosphate-buffered  saline  (PBS).  500  pi  of  the  sample  was  then  again  centrifuged  and  the 
pellet  resuspended  in  500  pi  (3-mercaptoethanol-containing  RLT  buffer  (Qiagen,  Valencia,  CA), 
homogenized  and  stored  in  -80°C  until  further  processing.  Total  RNA  was  isolated  using  the 
RNeasy  Mini  Kit  according  to  the  manufacturer’s  instructions  (Qiagen). 

RNA  processing  for  microarrays.  Total  RNA  samples  were  preprocessed  for  subsequent 
hybridization  to  expression  arrays  using  the  WT-Ovation  and  Encore™  Biotin  Module  from 
NuGEN  Technologies  Inc.  (San  Carlos,  CA)  according  to  the  manufacturer’s  instructions. 
Briefly,  the  WT-Ovation  Pico  RNA  amplification  system  was  used  to  generate  amplified  cDNA 
using  5  ng  of  starting  RNA  material.  After  formation  of  double  stranded  cDNA,  DNA  was 
amplified  using  the  SPIA™  amplification  method,  a  linear  isothermal  DNA  amplification 
process  developed  by  the  vendor  (NuGEN  Technologies).  The  WT-Ovation™  Exon  Module 
(NuGEN  Technologies)  was  then  used  for  generation  of  amplified  sense  strand  cDNA  (ST- 
cDNA)  that  is  suitable  for  subsequent  array  analysis  with  the  Affymetrix  Human  Gene  1.0  ST 
array  platform  (Affymetrix,  Santa  Clara,  CA).  Fragmented  and  biotin-labeled  cDNA  was  then 
generated  using  the  Encore™  Biotin  Module  (NuGEN  Technologies)  using  5  pg  of  amplified 
cDNA.  Quality  and  size  distribution  of  unfragmented  SPIA-amplified  cDNA  and  subsequent 
fragmented  labeled  cDNA  were  assessed  by  loading  samples  on  an  RNA  6000  Nano  LabChip 
(Agilent)  and  analysis  with  Agilent  bioanalyzer  2000  (Agilent).  No  differences  in  quality  were 
noted  based  on  duration  of  sample  storage. 

Generation  of  microarray  raw  data  and  analysis.  Fragmented  and  labeled  cDNA  (2.5  pg) 
were  hybridized  onto  Human  Gene  1.0  ST  arrays  according  to  the  manufacturer’s  instructions 
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(Affymetrix).  Hybridization  cocktails  containing  samples,  control  oligonucleotide  and  eukaryotic 
hybridization  controls  in  addition  to  hybridization  mixes,  DMSO  and  nuclease-free  water  were 
heat  denatured  at  99°C  for  5  minutes,  cooled  to  45°C  for  5  minutes,  and  finally  centrifuged  at 
maximum  speed  for  1  minute.  After  injecting  80  |iL  of  the  hybridization  cocktails,  arrays  were 
incubated  for  17  hours  in  a  hybridization  oven  set  to  a  temperature  of  45°C  with  60  rpm 
rotation.  Arrays  were  washed,  stained  and  processed  using  Affymetrix  GeneChip  Fluidics 
Station  450  systems  after  which  they  were  imaged  using  Affymetrix  GeneChip  Scanner  3000  7G 
for  subsequent  generation  of  raw  data  (*CEL  fdes).  Raw  data  were  quantified  using  Robust 
Multichip  Array  (RMA)  background  correction,  quantile  normalization  and  RMA  probe-level 
models  (RmaPlm)  summarization  methods.  MIAME  compliant  metadata,  normalized  expression 
values  and  391  CEL  files  were  submitted  to  the  gene  expression  omnibus  (GEO)  (  samples 
GSM992943-GSM993345,  series  GSE40407).  After  data  preprocessing  and  normalization,  a  log 
base  2  transformation  was  applied.  Pathways  and  gene-interaction  network  analyses  were 
performed  using  the  commercially  available  software  Ingenuity  Pathways  analysis.  All  details  of 
the  microarray  analysis  including  pairwise  analysis  of  adjacent  and  contralateral  airways  for 
patient  clustering  are  included  in  the  Supplementary  Information  and  in  the  four  supplementary 
sweave  reports  accompanying  the  manuscript. 

Immunohistochemical  analysis  of  airway  biopsies.  Immunohistochemistry  was  done  on 
histological  sections  of  4  micron  formalin-fixed  paraffin-embedded  tissue  samples  prepared  by  a 
tissue  arrayer  as  described  previously  (28).  Immunohistochemistry  analysis  was  performed  using 
purified  rabbit  polyclonal  primary  antibodies  raised  against  phospho-AKT(Threonine308)  (1:200 
dilution,  clone  C31E5E,  catalog  number  2965)  and  phospho-ERKl/2(Thr202/Tyr204)  (1:400 
dilution,  clone  D13.14.4E,  catalog  number  4370)  (Cell  Signaling  Technology,  Danvers,  MA). 
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Antigen  retrieval  was  performed  using  the  Dako  target  retrieval  system  at  a  PH  of  6  (Dako, 
Carpinteria,  CA).  Intrinsic  peroxidase  activity  was  blocked  by  3%  methanol  and  hydrogen 
peroxide  for  1 5  min  and  serum- free  protein  block  (Dako)  was  used  for  7  min  for  blocking  non¬ 
specific  antibody  binding.  Slides  were  then  incubated  with  the  antibodies  against  phospho-AKT 
and  phospho-ERKl/2  at  room  temperature  for  90  and  120  min,  respectively.  After  three  washes 
in  Tris-buffered  saline,  slides  were  incubated  for  30  min  with  Dako  Envision+  Dual  Link  at  room 
temperature.  Following  three  additional  washes,  slides  were  incubated  with  Dako  chromogen 
substrate  for  5  min  and  were  counterstained  with  hemotoxylin  for  another  5  min.  Formalin-fixed 
and  paraffin-embedded  pellets  from  lung  cancer  cell  lines  displaying  positive  phospho-AKT  and 
phospho-ERKl/2  expression  by  Western  blot  analysis  were  used  as  a  positive  control,  whereas 
samples  and  whole-section  tissue  specimens  processed  similarly,  except  for  the  omission  of  the 
primary  antibodies  were  used  as  negative  controls.  The  intensity  and  extent  of  cytoplasmic  and 
nuclear  phospho-AKT  and  phospho-ERKl/2  immunostaining  was  evaluated  using  a  light 
microscope  (magnification,  x20)  independently  by  two  pathologists  (J.F.  and  I.I.W.)  who  were 
blinded  to  the  identity  of  the  samples.  Cytoplasmic  expression  was  quantified  using  a  four-value 
intensity  score  (0,  none;  1,  weak;  2,  moderate  and  3,  strong)  and  the  percentage  (0-100%)  of  the 
extent  of  reactivity).  A  final  cytoplasmic  expression  score  was  obtained  by  multiplying  the 
intensity  and  reactivity  extension  values  (range,  0-300).  Nuclear  expression  score  was  quantified 
using  the  percentage  of  extent  of  reactivity  (range,  0-100). 

Summary  statistics,  including  frequency  tabulation,  means,  standard  deviations,  median 
and  range  were  given  to  describe  subject  characteristics  and  immunohistochemical  protein 
expression.  Repeated  measures  analysis  was  performed  to  assess  the  differential  effect  of  sites  on 
phosphorylated  AKT  and  ERK1/2  expression  variation  with  time.  Mixed  effects  models  were 
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generated  to  assess  significance  of  site,  time  and  the  interaction  of  both  factors  to  the  expression 
of  both  proteins.  All  statistical  tests  were  two-sided  and  p-values  of  0.05  or  less  were  considered 
to  be  statistically  significant.  Statistical  analysis  was  performed  with  standard  statistical 
software,  including  SAS  Release  8.1  (SAS  Institute,  Cary,  NC)  and  S-Plus  2000  (Mathsoft  Inc., 
Seattle,  WA). 

Results 

Detailed  site-  and  time-dependent  airway  sampling  of  the  field  of  injury  in  early  stage 
NSCLC  patients  after  definitive  surgery 

Expression  patterns  molecularly  exemplifying  the  impact  of  smoking  on  the  airway 
epithelium  of  cancer- free  individuals  have  been  characterized  (8,  19,  29).  Moreover,  molecular 
abnormalities  typically  found  in  lung  tumors  have  been  detected  in  normal  resected  margins 
suggestive  of  a  field  effect  (15,  25-27,  30,  31).  However,  the  biological  nature  and  clinical 
relevance  of  the  field  of  injury  in  particular  after  complete  removal  of  the  tumor  in  early  stage 
NSCLC  patients,  who  are  increasing  in  number  and  for  whom  there  are  no  chemoprevention 
strategies,  is  yet  unknown.  Smoker  early  stage  NSCLC  patients  were  recruited  on  a  prospective 
phase  II  surveillance  clinical  trial  that  included  frequent  computed  tomography  (CT)  scans, 
serologies  and  annual  bronchoscopies  in  which  airway  brushings  and  biopsies  were  obtained 
within  one  year  following  tumor  resection  and  at  12,  24  and  36  months  following  the  first  time 
point  (Supplementary  methods  and  Figure  1).  The  first  time  point  bronchoscopies  were  all 
performed  within  one  year  of  definitive  surgery  (average,  6  months;  range,  1-12).  Nineteen 
patients  were  selected  for  the  study  (Table  1)  based  on  airway  sampling  of  at  least  five  different 
sites  per  time  point  and  continuously  up  to  24  or  36  months  giving  rise  to  391  airway  samples  for 
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transcript-level  expressing  profiling.  The  patients  were  accrued  between  2004  and  2008  and  were 
comprised  of  former  (n=16)  and  current  (n=3)  smokers.  Brushings  and  biopsies  were  obtained 
from  the  main  carina  (MC),  airways  relatively  adjacent  (ADJ)  to  the  previously  resected  tumor 
(ADJ)  and  from  airways  more  distant  from  the  tumor  in  the  ipsilateral  (NON-ADJ)  and 
contralateral  (CONTRA)  lung  (Figure  1). 

Following  normalization  of  the  raw  expression  data,  a  mixed-effects  model  was  applied 
to  characterize  the  expression  pattern  of  each  gene  that  incorporated  fixed  effects  such  as  the  site 
from  the  tumor  and  time  after  surgery  of  the  collected  airway  samples  (Supplementary 
information).  Histogram  p-value  distribution  plots  after  fitting  beta-uniform-mixture  (BUM) 
models  (32)  for  derived  p-values  based  on  the  fixed  effects  suggested  that  both  site  and  time  of 
the  airway  samples  influenced  gene  expression  modulation  (Supplementary  Figures  1A  and  IB). 

Site-dependent  differential  expression  patterns  in  airway  epithelia  of  early  stage  NSCLC 
patients 

We  first  sought  to  determine  whether  gene  expression  profiles  are  differentially 
expressed  in  airways  by  site  from  the  tumor  including  those  relatively  adjacent  to  the  resected 
tumors  compared  to  more  distant  airways.  Based  on  the  generated  BUM  models  and  p-value 
distributions,  genes  differentially  expressed  by  site  were  selected  based  on  a  1%  false  discovery 
rate  (FDR).  We  identified  1165  gene  features  that  were  statistically  significantly  differentially 
expressed  by  site  (Supplementary  Table  1).  Two-dimensional  hierarchical  clustering 
demonstrated  that  the  airway  samples  were  divided  into  two  main  branches  or  clusters  (Figure 
2A)  based  on  expression  of  the  genes.  Moreover,  the  left  cluster  in  the  indicated  heat  map’s 
dendogram  contained  a  statistically  significantly  higher  number  of  adjacent  airway  samples 
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compared  to  the  right  branch  which  contained  a  significantly  higher  proportion  of  main  carinas 
and  contralateral  to  the  tumor  airways  (p=0.0027  of  the  Fisher’s  exact  test  for  count  data).  In 
addition,  the  two-dimensional  clustering  revealed  eight  different  gene  expression  patterns  which 
are  indicated  by  the  color  bar  and  code  along  the  left  side  of  the  heat  map  (Figure  2A).  Some  of 
the  different  gene  clusters  or  classes  were  associated  with  a  specific  group  of  airway  samples. 
Notably,  a  cluster  of  263  genes  (cluster  1,  Supplementary  Table  1),  indicated  by  the  dark  green 
color  and  asterix  on  the  heat  map,  was  found  to  have  highest  average  expression  in  adjacent 
airways  (Figure  2A  and  Supplementary  Table  2).  In  contrast,  another  cluster  of  348  genes 
(indicated  by  magenta  color)  appeared  to  be  highest  in  expression  in  main  carinas  (Figure  2A). 

We  then  determined  to  functionally  analyze  the  cluster  of  genes  (n=263),  that  was  found 
to  exhibit  the  highest  average  expression  in  adjacent  airways,  between  adjacent  and  contralateral 
airways.  Functional  pathways  analysis  using  Ingenuity  Pathways  Analysis  (IP A)  depicted  several 
significantly  modulated  pathways  and  molecular  functions  in  particular,  T-cell  (p=1.2  x  10'9), 
chemokine  C-C  motif  receptor  5  (CCR5)  (1.5  x  10'9)  and  phospholipase  C  signaling  (p=3.6  x  10' 
9).  In  addition,  topological  gene-interaction  network  analysis  highlighted  functionally  modulated 
and  up-regulated  gene  networks  mediated  by  phosphoinositide  3 -kinase  (PI3K)  and  extracellular 
regulate  kinase  (ERK)  in  the  adjacent  airways  (Figure  2B).  It  is  worthwhile  to  note  that  we  also 
observed  increased  modulation  of  ERK  MAPK-mediated  gene-interaction  network  using  a 
different  analytical  method  where  we  compared  expression  profiles  between  adjacent  and 
contralateral  between  patients  in  a  pairwise  fashion  (data  not  shown,  Supplementary  sweave 
report  3).  These  findings  suggest  that  airway  site-dependent  differential  gene  expression  profiles 
in  early  stage  NSCLC  patients  exhibit  increased  molecular  features  and  gene-interaction 
networks  typically  associated  with  cancers. 
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We  then  determined  to  analyze  gene  expression  profiles  while  excluding  main  carinas 
since  their  epithelial  anatomy  is  suggested  to  be  different  from  that  of  other  airways  (33)  and 
thus  may  confound  site-dependent  observations.  Following  exclusion  of  main  carinas,  we  found 
a  reduced  number  of  genes  (n=136)  that  were  significantly  modulated  by  site  in  the  field  of 
injury  (Supplementary  sweave  report  4).  Two-dimensional  hierarchical  clustering  demonstrated 
that  airway  samples  were  divided  into  two  main  branches  or  clusters  (Supplementary  Figure  2) 
with  significantly  more  adjacent  airways  in  the  right  branch  (p=0.0002  of  the  Fisher’s  exact  t- 
test).  Moreover,  the  differentially  expressed  genes  (Supplementary  Table  4)  were  comprised  of 
two  main  subgroups  with  one  cluster  (top  cluster)  of  113  genes  exhibiting  highest  average 
expression  in  adjacent  airways.  It  is  important  to  note  that  when  we  cross-compared  gene 
clusters  that  we  had  found  to  exhibit  the  highest  average  expression  in  adjacent  airways  when 
including  (263  gene  cluster)  or  excluding  (113  gene  cluster)  main  carinas,  we  found  a  highly 
significant  overlap  (p=2.46  x  10~191)  in  the  number  of  genes  (n=96).  Moreover,  the  site- 
dependent  genes  identified  after  exclusion  of  main  carinas  were  also  topologically  organized 
following  functional  pathways  analysis  into  interaction  networks  involving  PI3K  and  ERK. 

Gene  expression  profiles  in  the  lung  airway  epithelia  of  early  stage  NSCLC  patients  are 
modulated  with  time  following  definitive  surgery 

We  then  determined  to  identify  genes  that  were  differentially  expressed  with  time.  Based 
on  the  generated  BUM  models  and  p-value  distributions,  time-dependent  differentially  expressed 
genes  were  identified  and  selected  based  on  a  5%  FDR  cut-off  (n=1395,  Supplementary  Table  4 
and  Supplementary  sweave  report  2).  Hierarchical  clustering  of  samples  indicated  that  the 
dendogram’s  main  branches  were  statistically  significantly  unbalanced  with  respect  to  time;  the 
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main  left  branch  comprised  a  higher  number  of  24  and  36  month  time  points  compared  to  the 
right  cluster  (p=4.15  x  10"7  of  the  Fisher’s  exact  test  for  count  data).  Two-dimensional  clustering 
of  both  genes  and  samples  revealed  two  main  classes  of  genes,  those  that  displayed  increased 
(upper  cluster)  and  decreased  (lower  cluster)  expression  with  time  (Figure  3A).  Functional 
pathways  and  gene-interaction  network  analysis  of  genes  differentially  expressed  genes  between 
36  months  and  the  first  time  point  revealed  statistically  significantly  modulated  pathways,  in 
particular  protein  ubiquitination  (5.3  x  10‘5),  glutathione  metabolism  (6.0  x  10~5),  mitochondrial 
dysfunction  (1.4  x  10~4)  and  oxidative  phosphorylation  (2.9  x  10~3)  as  well  as  eukaryotic 
initiation  factor  2  (eIF2)  signaling  (2.6  x  10'3).  In  addition,  network  analysis  highlighted 
functionally  modulated  and  elevated  gene-interaction  networks  with  time  in  particular  those 
mediated  by  Nuclear  factor-kappa  B  (NF-kB),  ERK,  AKT  and  cyclin-Bl  (CCNB1)  (Figure  3B). 

We  then  sought  to  determine  the  relationship  of  genes  that  were  significantly  modulated 
by  site  and  time  in  the  molecular  field  of  injury.  A  smooth  scatter  plot  of  transformed  p-values 
indicated  that  site-  and  time-dependent  expression  modulations  were  largely  independent 
(Supplementary  Figure  3  and  Supplementary  sweave  report  2).  We  then  cross-compared  the  site- 
dependent  (n=1165)  and  time-dependent  (n=1395)  profiles  that  we  had  noted  in  the  molecular 
field  of  injury.  Using  hypergeometric  tests  for  overlapping  probability,  we  found  no  significant 
overlap  between  genes  we  had  determined  to  be  differentially  expressed  by  site  and  time  in  the 
molecular  field  of  injury  (p=0.865)  (Supplementary  sweave  report  4). 
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Increased  expression  of  phosphorylated  AKT  and  ERK1/2  in  airway  epithelia  by  site  from 
the  tumor  and  with  time  following  surgery 

Our  findings  on  the  modulation  of  PI3K-  and  ERKl/2-mediated  networks  by  site  and 
time  after  surgery  prompted  us  to  examine  the  immunohistochemical  (IHC)  expression  of 
surrogate  markers  of  both  signaling  cascades  in  corresponding  formalin- fixed  paraffin  embedded 
(FFPE)  airway  biopsies.  We  sought  to  examine  expression  of  AKT  phosphorylated  at 
Threonine308  since  phosphorylation  of  this  amino  acid  is  well  known  to  occur  through 
phosphoinositide-dependent  kinase  1  (PDPK1)  following  PI3K  activation  (34).  We  assessed  by 
IHC  the  immunohistochemical  expression  of  phospho-AKT(Thr308)  and  phospho- 
ERKl/2(Thr202/Tyr204)  in  available  and  evaluable  histologically  normal  bronchial  epithelia 
biopsies  (n=324)  corresponding  to  the  brushings  analyzed  by  expression  profiling. 
Immunoreactivity  of  phospho-AKT  (min  0,  max  300)  was  variable  as  depicted  in  the 
representative  photomicrographs  in  Figure  4A  and  was  detected  in  both  the  cytoplasm  and 
nucleus  of  normal  bronchial  epithelia  (NBE)  (Figure  4A).  IHC  analysis  demonstrated  that 
cytoplasmic  (p<0.0001)  and  nuclear  (p=0.01)  phospho-AKT  statistically  significantly  increased 
with  time  up  to  three  years  in  all  NBE  (Figure  4B)  with  highest  expression  at  the  36  month  time 
point.  Nuclear  phosphorylated  AKT  was  also  statistically  significantly  increased  in  adjacent 
NBE  compared  to  relatively  airways  more  distant  from  tumors  in  the  mixed-effects  model 
(p=0.04)  (Figure  4B). 

Immunoreactivity  of  phospho-ERKl/2  was  also  variable  (min  0,  max  209)  and  mainly 
cytoplasmic  (Figure  4C).  IHC  analysis  demonstrated  that  phosphorylated  ERK1/2  was 
statistically  significantly  elevated  in  adjacent  NBE  (p=0.03)  and  significantly  increased  with  time 
up  to  three  years  in  all  airways  when  averaged  together  (p=0.02)  (Figure  4D)  in  the  model. 
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Notably,  there  was  a  significant  interaction  (p=0.019)  between  site  of  NBE  and  time  of  sampling 
as  phospho-ERKl/2  expression  was  significantly  increased  in  adjacent  NBE  but  not  in 
contralateral  airways  and  main  carinas  in  the  model  (Figure  4D)  with  highest  expression  in 
adjacent  airways  and  non-adjacent  (ipsilateral,  green  plot)  airways  in  observed  at  the  latest  time 
point.  Similar  data  were  obtained  when  we  excluded  main  carinas  in  the  mixed-effects  model 
(data  not  shown).  Moreover,  we  noted  similar  findings  when  all  airway  samples  were  analyzed 
irrespective  of  the  presence  of  preneoplastic  lesions  (e.g.  dysplasias). 

These  data  demonstrated  that  like  differential  gene  expression  profiles  within  the  lung 
airway  field  of  injury,  canonical  activated  oncogenes  are  modulated  by  site  from  the  resected 
tumor  and  time  following  definitive  surgery  in  early  stage  NSCLC  patients. 

Discussion 

In  this  report,  we  characterized  differential  expression  profiles  and  protein  expression 
within  the  lung  airway  field  of  injury  of  early  stage  NSCLC  patients  by  site  from  the  tumor  and 
time  in  years  following  surgery.  We  demonstrated,  and  to  our  knowledge  for  the  first  time,  that 
gene  expression  profiles  in  histologically  normal  airways  of  early  stage  NSCLC  patients  are  non- 
uniform  by  site  and  are  modulated  with  time  up  to  three  years  following  surgery.  Moreover, 
functional  analysis  of  the  expression  profiles  demonstrated  that  canonical  expression  patterns 
and  protein  kinase  activation,  typical  of  tumors,  are  increased  in  airway  sites  adjacent  to  tumors 
as  well  as  remain  or  are  further  modulated  in  the  lung  airway  field  of  injury  for  three  years  after 
definitive  surgery.  In  particular,  phosphorylated  ERK  MAPK  and  AKT  expression  were  up- 
regulated  in  normal  bronchial  epithelia  with  time  and  by  site  from  tumors.  In  light  of  the  growing 
sub-population  of  early  stage  NSCLC,  our  findings  are,  in  part,  proof  of  principle  and  raise  the 
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intriguing  possibility  of  the  importance  of  intense  surveillance  and  molecular  characterization  of 
the  remaining  smoking-injured  airway  epithelia  and  its  potential  integration  in  the  future  into 
clinical  practice  and  management  of  early  stage  disease.  However,  it  is  noteworthy  that  our 
study’s  patient  cohort,  despite  its  uniqueness  in  which  expression  profiling  was  performed  on 
airways  from  multiple  sites  collected  during  bronchoscopies  serially  performed  for  36  months 
following  surgery,  is  of  limited  size.  Moreover,  the  reported  findings  warrant  the  need  for 
validation  or  confirmation  in  independent  larger  sets. 

There  is  a  large  body  of  evidence  that  patients  who  have  survived  an  upper  aerodigestive 
cancer  comprise  a  high  risk  population  that  may  be  targeted  for  early  detection  and 
chemoprevention  efforts  (5,  31).  Currently  there  are  no  established  adjuvant  treatments  in  the 
tertiary  prevention  setting  of  early  stage  NSCLC  patients.  It  has  been  suggested  that  failures  in 
advances  of  chemoprevention  are  in  part  due  to  lack  of  clear  and  specific  molecular  targets  (5, 
35).  Our  extensive  profiling  of  the  molecular  field  of  injury  in  NSCLC  patients  identified  cancer- 
associated  pathways  (PI3K  and  ERK)  aberrantly  regulated  in  normal  bronchial  epithelia  of 
NSCLC  patients  after  tumor  removal.  In  this  context,  it  is  plausible  to  suggest  that  a  thorough 
characterization  of  the  molecular  field  of  injury  in  early  stage  NSCLC  patients  can  aid  in 
identification  of  aberrantly  expressed  pathways,  e.g.  PI3K  and  ERK,  which  could  potentially 
serve  as  suitable  targets  for  chemoprevention.  However,  it  cannot  be  neglected  that  the 
alternative  hypothesis  can  counter  argue  that  activation  of  PI3K  and  ERK  MAPK  pathways  may 
be  beneficial  for  chemoprevention  since  markers  of  both  pathways  (phosphorylated  AKT  and 
ERK)  increased  following  surgical  tumor  resection.  Our  suggestion  that  such  pathways  may 
serve  as  chemoprevention  targets  should  be  interpreted  cautiously  and  is  presented  owing  to  the 
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known  promalignant  function  of  the  pathways  and  gene  networks  highlighted  in  our  analysis 
(36). 

The  first  time  point  brushings  in  this  study  were  obtained  within  one  year  of  patient 
accrual  and  not  at  the  time  when  the  tumor  was  still  in  situ.  The  variable  starting  time  point  from 
surgery  among  patients  is  a  limiting  factor  in  our  analysis  since  it  is  plausible  to  assume  that  the 
molecular  field  of  injury  may  vary  upon  tumor  removal.  We  were  not  able  to  avoid  this  caveat 
given  the  difficulty  of  accrual  of  patients  following  surgical  tumor  resection  to  obtain  bronchial 
brushings  at  six  different  anatomical  sites  in  the  bronchial  tree  annually  up  to  three  years. 
However,  it  is  important  to  mention  that  the  time-dependent  gene  expression  profiles  we  had 
identified,  despite  not  incorporating  the  molecular  field  effect  at  the  time  the  tumor  was  present, 
demonstrated  gradual  changes  in  expression  with  time.  This  effect  was  also  evident  when 
analyzing  the  immunohistochemical  expression  of  phosphorylated  AKT  and  ERK  in 
corresponding  bronchial  biopsies  with  highest  expression  at  three  years.  Our  findings  warrant 
future  larger  studies  in  which  the  molecular  field  of  injury  at  the  time  the  tumor  is  still  in  situ  can 
also  be  serially  monitored  for  several  years. 

The  patient  population  we  had  studied  was  comprised  of  early  stage  NSCLC  patients, 
which  is  in  contrast  to  earlier  transcriptomic  studies  of  the  molecular  field  of  injury  that  focused 
on  phenotypically  normal  smokers  and  non-smokers  (19,  21,  22).  It  is  still  not  clear  whether  the 
differences  in  expression  described  in  this  study  reflect  an  already  present  gradient  field  of  injury 
that  may  have  contributed  to  tumor  development  in  light  of  the  differential  cancer-associated 
pathways  identified  or  one  that  arises  due  to  the  molecular  impact  of  the  tumor  on  the  adjacent 
field.  It  is  important  to  note  that  in  this  study,  the  spatial  and  temporal  molecular  field  of  injury 
in  lung  cancer  patients  was  profiled  prospectively  starting  within  one  year  following  definitive 
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surgery.  Thus,  the  above  speculation  may  be  addressed  by  a  similar  thorough  spatial  and 
temporal  characterization  of  the  molecular  field  of  injury  before  and  after  surgery  in  early  stage 
patients.  In  addition,  we  did  not  have  access  to  similar  type  of  brushings  from  cancer-free 
individuals  such  as  high-risk  smokers.  Similar  analysis  of  the  molecular  field  of  injury  in  cancer- 
free  individuals  will  shed  light  on  the  nature  of  site-  and  time-dependent  expression  patterns  in 
the  field  of  injury  and  whether  such  changes  in  cancer  patients  reflect  recovery  from  surgery 
(temporal  profiles)  or  are  a  cause  or  consequence  of  tumor  development  in  the  adjacent  field 
(spatial  profiles). 

In  conclusion,  our  unique  study  identified  gene  expression  profiles,  functional  gene- 
networks  and  activated  levels  of  oncogenic  protein  kinases  within  the  field  of  injury  of  early 
stage  NSCLC  patients  that  are  modulated  or  increased  in  airways  spatially  from  the  tumor  and 
temporally  following  surgery.  Moreover,  the  herein  previously  uncharacterized  airway  cancer- 
associated  expression  and  protein  kinase  alterations  harbor  potential  valuable  targets  for 
chemoprevention  and  warrant  confirmation  and  further  studies  in  larger  independent  cohorts. 
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Table  1.  Clinicopathological  features  of  NSCLC  patients  included  in  the  study 


Patient 

Histology 

Anatomical 

site 

Stage 

Recurrence 

Adjuvant 

chemotherapy 

Gender 

Age 

Smoking 

status 

Pack 

years 

Years 

quit 

smoking 

Months  to 
inclusion 
from 
surgery 

1 

ADC 

LUL 

IIA 

Yes 

No 

Male 

81 

Former 

100 

12.0 

1 

2 

ADC 

RLL 

IB 

No 

No 

Male 

64 

Former 

27 

19.1 

5 

3 

ADC 

RUL 

IB 

Yes 

Yes 

Male 

58 

Former 

12 

25.5 

7 

4 

SCC 

RLL 

IB 

Yes 

Yes 

Female 

62 

Former 

60 

7.6 

7 

5 

ADC 

RLL 

IA 

No 

No 

Female 

60 

Former 

31 

11.8 

6 

6 

ADC 

LLL 

IA 

Yes 

Yes 

Male 

62 

Former 

40 

5.8 

5 

7 

ADC 

RUL 

IB 

Yes 

Yes 

Male 

61 

Current 

70 

NA 

10 

8 

SCC 

LUL 

IB 

No 

No 

Male 

64 

Current 

100 

NA 

5 

9 

ADC 

RUL 

IA 

No 

No 

Male 

69 

Former 

12 

36.5 

8 

10 

ADC 

RUL 

IB 

No 

No 

Female 

63 

Former 

18 

8.6 

5 

11 

ADC 

RUL 

IB 

Yes 

No 

Female 

53 

Former 

42 

5.8 

3 

12 

SCC 

RUL 

IA 

No 

No 

Male 

62 

Former 

68 

0.4 

5 

13 

SCC 

LUL 

IA 

No 

No 

Male 

70 

Former 

96 

1.4 

7 

14 

ADC 

LUL 

IA 

Yes 

Yes 

Female 

45 

Former 

12.5 

0.9 

12 

15 

ADC 

RLL 

IA 

No 

No 

Female 

57 

Current 

80 

NA 

4 

16 

ADC 

LLL 

IA 

No 

No 

Male 

65 

Former 

48 

21.0 

7 

17 

ADC 

LLL 

IA 

No 

No 

Male 

64 

Former 

92 

5.1 

7 

18 

ADC 

RML 

IB 

No 

No 

Male 

71 

Former 

102 

2.3 

2 

19 

ADC 

LUL 

IA 

No 

No 

Female 

66 

Former 

50 

18.5 

5 

*  Anatomical  site:  location  of  primary  tumor  in  the  lung  (LUL,  left  upper  lobe;  LLL,  left  lower 
lobe;  RUL,  right  upper  lobe;  RML,  right  middle  lobe;  RLL,  right  lower  lobe 

*  Smoking  status:  smoking  status  at  time  of  inclusion  into  the  study  (patient  7  quit  smoking 
during  the  course  of  the  study). 

*  Years  quit  smoking:  Years  from  smoking  cessation  to  time  of  inclusion  into  the  study. 

*Months  to  inclusion  from  surgery:  Months  elapsed  from  surgery  to  time  of  inclusion  into  the 
study/first  bronchoscopy  time  point. 

ADC,  adenocarcinoma;  SCC,  squamous  cell  carcinoma 
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Figure  legends 

Figure  1.  Spatial  and  temporal  molecular  field  of  injury  in  early  stage  NSCLC  patients 
after  definitive  surgery.  Schematic  depicting  the  site-  (top)  and  time-  (bottom)  dependent 
collection  of  airway  epithelia  brushings  by  bronchoscopy.  Smoker  early-stage  NSCLC  patients 
were  enrolled  into  a  surveillance  clinical  trial  for  annual  follow-up  and  bronchoscopies  within 
one  year  after  definitive  surgery.  Bronchial  airway  epithelial  cells  (brushings  and  biopsies)  were 
obtained  from  five  to  six  different  sites  comprised  of  the  main  carina  (MC),  4  airways  from  4 
lobes  ipsilateral  (NON-ADJ)  or  contralateral  (CONTRA)  to  the  originally  resected  tumor  and  of 
the  bronchial  stump  or  main  stem  bronchus  adjacent  to  the  tumor  and  lobe.  All  site-different 
airway  epithelia  were  collected  at  the  time  of  inclusion  in  the  study  and  at  12,  24  and  36  months 
following  the  starting  time  point  (391  airways  from  19  NSCLC  patients). 

Figure  2.  Site-dependent  airway  epithelia  differential  gene  expression  patterns.  A.  Heat  map 
depicting  two-dimensional  clustering  of  airway  samples  (n=391)  and  genes  (n=1165)  that  were 
determined  to  be  differentially  expressed  by  site  in  the  mixed-effects  model  based  on  a  1%  FDR 
cut-off.  The  identified  eight  gene  clusters  are  labeled  with  different  colors  with  the  green  cluster 
of  genes  (n=263,  *A)  exhibiting  highest  average  expression  in  adjacent  airways  and  the  magenta 
cluster  (n=348,  *MC)  having  highest  expression  in  main  carinas.  B.  Gene-interaction  analysis  by 
IPA  depicting  networks  with  significant  scores  that  indicate  the  likelihood  of  genes  in  a  network 
being  found  together  than  due  to  chance.  The  depicted  networks  were  found  to  be  mediated  by 
PI3K  (top)  and  ERK1/2  (bottom)  with  both  kinases  themselves  not  modulated  in  expression. 
Gene  expression  variation  based  on  the  statistical  cut-off  described  above  is  depicted  by  color  in 
the  network  (red,  up-regulated;  green,  down-regulated). 
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Figure  3.  Temporal  modulation  of  the  molecular  field  of  injury  after  definitive  surgery  in 
early  stage  NSCLC  patients.  A.  Heat  map  depicting  two-dimensional  clustering  of  airway 
samples  (n=391)  and  genes  (n=1395)  that  were  determined  to  be  differentially  expressed  by  time 
in  the  mixed-effects  model  based  on  a  5%  FDR  cut-off.  B.  Gene-interaction  analysis,  similar  to 
that  in  Figure  2,  by  IPA  depicting  networks  with  increased  likelihood  of  genes  being  found 
together  than  due  to  chance  and  mediated  by  NF-kB,  ERK1/2,  AKT  and  CCND1.  CCND1  itself 
was  up-regulated  at  the  expression  level.  Gene  expression  variation  based  on  the  statistical  cut¬ 
off  is  depicted  by  color  in  the  network  (red,  up-regulated;  green,  down-regulated). 

Figure  4.  Site-  and  time-dependent  immunohistochemical  expression  of  phosphorylated 
AKT  and  ERK1/2  in  the  airway  field  of  injury.  A.  Representative  photomicrographs  (20x 
magnification)  depicting  strong  (top)  and  weak  (bottom)  phospho-AKT(Thr308) 
immunostaining.  B.  Immunohistochemical  scores  of  cytoplasmic  (left)  and  nuclear  phospho- 
AKT  (middle)  were  assessed  for  statistically  significant  differences  by  site  and  time  in  a  mixed- 
effect  model  and  plotted  in  main  carinas  (MC)  and  in  adjacent  (ADJ),  non-adjacent  (ipsilateral, 
NON-ADJ)  and  contralateral  (CONTRA)  airways  with  time.  C.  Representative 
photomicrographs  (20x  magnification)  depicting  strong  (top)  and  weak  (bottom)  phospho- 
ERKl/2(Thr202/Tyr204)  immunostaining.  D.  Immunohistochemical  scores  of  phosphorylated 
ERK1/2  levels  were  assessed  for  statistically  significant  differences  by  site  and  time  in  a  mixed- 
effect  model  and  repeated  measures  analysis,  site*time,  term  for  interaction  between  site  and 
time  factors.  Error  bars  represent  S.E.  of  the  mean. 
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